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SEQUENCE-DETERMINED DNA FRAGMENTS AND CORRESPONDING 
POLYPEPTIDES ENCODED THEREBY 

This application claims priority under 35 USC §1 19(e), §1 19(a-d) and §120 of the following 
application, the entire contents of which are hereby incorporated by reference: 
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8/20/1999 


United States 


2750-0540P 


80143.001 


60/149,930 


8/23/1999 


United States 


2750-0543P 


00088.001 


60/149,902 


8/23/1999 


United States 


2750-0544P 


00089.001 


60/150,566 


8/25/1999 


United States 


2750-0547P 


00090.001 


60/150,884 


8/26/1999 
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Country 


Attorney No 
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Application No 


Filing Date 


United States 


2750-0545P 


80144.001 


60/151,065 


8/27/1999 


United States 


2750-0546P 


80144.002 


60/151,066 


8/27/1999 


United States 


2750-0548P 


00091.001 


60/151,080 


8/27/1999 


United States 


2750-0549P 


00092.001 


60/151,303 


8/30/1999 


United States 


2750-0552P 


00093.001 


60/151,438 


8/31/1999 


United States 


2750-0553P 


00094.001 


60/151,930 


9/1/1999 


United States 


2750-0554P 


00095.001 


60/152,363 


9/7/1999 


United States 


2750-0555P 


00096.001 


60/153,070 


9/10/1999 


United States 


2750-0556P 


00098.001 


60/153,758 


9/13/1999 


United States 


2750-0557P 


00099.001 


60/154,018 


9/15/1999 


United States 


2750-0558P 


00101.001 


60/154,039 


9/16/1999 


United States 


2750-0559P 


00102.001 


60/154,779 


9/20/1999 


United States 


2750-0560P 


00103.001 


60/155,139 


9/22/1999 


United States 


2750-056 IP 


00104.001 


60/155,486 


9/23/1999 


United States 


2750-0562P 


00105.001 


60/155,659 


9/24/1999 


United States 


2750-0563P 


00106.001 


60/156,458 


9/28/1999 


United States 


2750-0564P 


00107.001 


60/156,596 


9/29/1999 


United States 


2750-0570P 


00108.001 


60/157,117 


10/4/1999 


United States 


2750-0571P 


00109.001 


60/157,753 


10/5/1999 


United States 


2750-0572P 


00110.001 


60/157,865 


10/6/1999 


United States 


2750-0575P 


00111.001 


60/158,029 


10/7/1999 


United States 


2750-0576P 


00112.001 


60/158,232 


10/8/1999 


United States 


2750-0577P 


00113.001 


60/158,369 


10/12/1999 


United States 


2750-0579P 


80146.002 


60/159,293 


10/13/1999 


United States 


2750-0583P 


80148.002 


60/159,294 


10/13/1999 


United States 


2750-0574P 


80145.002 


60/159,295 


10/13/1999 


United States 


2750-0582P 


80148.001 


60/159,329 


10/14/1999 


United States 


2750-0580P 


80147.001 


60/159,638 


10/14/1999 


United States 


2750-0578P 


80146.001 


60/159,331 


10/14/1999 


United States 


2750-0573P 


80145.001 


60/159,330 


10/14/1999 


United States 


2750-0581P 


80147.002 


60/159,637 


10/14/1999 


United States 


2750-0584P 


00116.001 


60/159,584 


10/18/1999 


United States 


2750-0585P 


00118.001 


60/160,815 


10/21/1999 


United States 


2750-0586P 


80149.001 


60/160,814 


10/21/1999 


United States 


2750-0587P 


80149.002 


60/160,770 


10/21/1999 


United States 


2750-0588P 


00119.001 


60/160,741 


10/21/1999 


United States 


2750-0589P 


80150.001 


60/160,768 


10/21/1999 


United States 


2750-0590P 


80150.002 


60/160,767 


10/21/1999 


United States 


2750-0593P 


80151.002 


60/160,981 


10/22/1999 


United States 


2750-0591P 


00120.001 


60/160,980 


10/22/1999 


United States 


2750-0592P 


80151.001 


60/160,989 


10/22/1999 


United States 


2750-0596P 


80152.002 


60/161,404 


10/25/1999 


United States 


2750-0594P 


00121.001 


60/161,405 


10/25/1999 


United States 


2750-0595P 


80152.001 


60/161,406 


10/25/1999 


United States 


2750-0598P 


80153.001 


60/161,360 


10/26/1999 


United States 


2750-0599P 


80153.002 


60/161,359 


10/26/1999 
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Country 
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CUent No 


Application No 


Filing Date 


United States 


2750-0597P 


00122.001 


60/161,361 


10/26/1999 


United States 


2750-0601P 


00123.001 


60/161,920 


10/28/1999 


United States 


2750-0602P 


80154.001 


60/161,992 


10/28/1999 


United States 


2750-0603P 


80154.002 


60/161,993 


10/28/1999 


United States 


2750-0606P 


80155.002 


60/162,228 


10/29/1999 


United States 


2750-0605P 


80155.001 


60/162,142 


10/29/1999 


United States 


2750-0604P 


00124.001 


60/162,143 


10/29/1999 


United States 


2750-0607P 


00125.001 


60/162,894 


11/1/1999 


United States 


2750-0608P 


80156.001 


60/162,891 


11/1/1999 


United States 


2750-0609P 


80156.002 


60/162,895 


11/1/1999 


United States 


2750-0610P 


00126.001 


60/163,093 


11/2/1999 


United States 


2750-061 IP 


80157.001 


60/163,092 


11/2/1999 


United States 


2750-0612P 


80157.002 


60/163,091 


11/2/1999 


United States 


2750-0613P 


00127.001 


60/163,249 


11/3/1999 


United States 


2750-0614P 


80158.001 


60/163,248 


11/3/1999 


United States 


2750-0615P 


80158.002 


60/163,281 


11/3/1999 


United States 


2750-0616P 


00128.001 


60/163,379 


11/4/1999 


United States 


2750-0617P 


80159.001 


60/163,381 


11/4/1999 


United States 


2750-061 8P 


80159.002 


60/163,380 


11/4/1999 


United States 


2750-0620P 


80160.001 


60/164,151 


11/8/1999 


United States 


2750-0621P 


80160.002 


60/164,150 


11/8/1999 


United States 


2750-0619P 


00129.001 


60/164,146 


11/8/1999 


United States 


2750-0623P 


80161.002 


60/164,260 


11/9/1999 


United States 


2750-0625P 


80162.002 


60/164,259 


11/9/1999 


United States 


2750-0624P 


80162.001 


60/164,317 


11/10/1999 


United States 


2750-0622P 


80161.001 


60/164,319 


11/10/1999 


United States 


2750-0626P 


80163.001 


60/164,321 


11/10/1999 


United States 


2750-0627P 


80163.002 


60/164,318 


11/10/1999 


United States 


2750-0628P 


00131.001 


60/164,544 


11/10/1999 


United States 


2750-0629P 


80164.001 


60/164,545 


11/10/1999 


United States 


2750-0630P 


80164.002 


60/164,548 


11/10/1999 


United States 


2750-0633P 


80165.002 


60/164,960 


11/12/1999 


United States 


2750-0632P 


80165.001 


60/164,871 


11/12/1999 


United States 


2750-0634P 


00133.001 


60/164,870 


11/12/1999 


United States 


2750-0635P 


80166.001 


60/164,959 


11/12/1999 


United States 


2750-0636P 


80166.002 


60/164,962 


11/12/1999 


United States 


2750-063 IP 


00132.001 


60/164,961 


11/12/1999 


United States 


2750-0637P 


00134.001 


60/164,927 


11/15/1999 


United States 


2750-0638P 


80167.001 


60/164,929 


11/15/1999 


United States 


2750-0639P 


80167.002 


60/164,926 


11/15/1999 


United States 


2750-0642P 


80168.002 


60/165,661 


11/16/1999 


United States 


2750-0640P 


00135.001 


60/165,669 


11/16/1999 


United States 


2750-0641P 


80168.001 


60/165,671 


11/16/1999 


United States 


2750-0645P 


80169.002 


60/165,911 


11/17/1999 


United States 


2750-0644P 


80169.001 


60/165,918 


11/17/1999 


United States 


2750-0643P 


00136.001 


60/165,919 


11/17/1999 
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Country 


Attorney No 


CUent No 


Application No 


Filing Date 


United States 


2750-0647P 


80170.001 


60/166,173 


11/18/1999 


United States 


2750-0648P 


80170.002 


60/166,158 


11/18/1999 


United States 


2750-0646P 


00137.001 


60/166,157 


11/18/1999 


United States 


2750-0650P 


80171.001 


60/166,411 


11/19/1999 


United States 


2750-065 IP 


80171.002 


60/166,412 


11/19/1999 


United States 


2750-0649P 


00139.001 


60/166,419 


11/19/1999 


United States 


2750-0652P 


00140.001 


60/166,733 


11/22/1999 


United States 


2750-0653P 


80172.001 


60/166,750 


.11/22/1999 


United States 


2750-0655P 


80173.002 


60/167,362 


11/23/1999 


United States 


2750-0654P 


80173.001 


60/167,382 


11/24/1999 


United States 


2750-0656P 


00141.001 


60/167,233 


11/24/1999 


United States 


2750-0657P 


80174.001 


60/167,234 


11/24/1999 


United States 


2750-0658P 


80174.002 


60/167,235 


11/24/1999 


United States 


2750-0659P 


00142.001 


60/167,904 


11/30/1999 


United States 


2750-0660P 


80175.001 


60/167,908 


11/30/1999 


United States 


2750-0661P 


80175.002 


60/167,902 


11/30/1999 


United States 


2750-0664P 


80176.001 


60/168,233 


12/1/1999 


United States 


2750-0665P 


80176.002 


60/168,231 


12/1/1999 


United States 


2750-0663P 


00143.001 


60/168,232 


12/1/1999 


United States 


2750-0667P 


80177.001 


60/168,549 


12/2/1999 


United States 


2750-0668P 


80177.002 


60/168,548 


12/2/1999 


United States 


2750-0666P 


00144.001 


60/168,546 


12/2/1999 


United States 


2750-0669P 


00145.001 


60/168,675 


12/3/1999 


United States 


2750-0670P 


80178.001 


60/168,673 


12/3/1999 


United States 


2750-0671P 


80178.002 


60/168,674 


12/3/1999 


United States 


2750-0672P 


00147.001 


60/169,298 


12/7/1999 


United States 


2750-0673P 


80179.001 


60/169,278 


12/7/1999 


United States 


2750-0674P 


80179.002 


60/169,302 


12/7/1999 


United States 


2750-0675P 


80180.001 


60/169,692 


12/8/1999 


United States 


2750-0676P 


80180.002 


60/169,691 


12/8/1999 


United States 


2750-0678P 


80181.001 


60/171,114 


12/16/1999 


United States 


2750-0679P 


80181.002 


60/171,098 


12/16/1999 


United States 


2750-0677P 


00149.001 


60/171,107 


12/16/1999 


International 


2750-0686F(PC) 


80070.100 


00/004,66 


1/7/2000 


United States 


2750-0684P 


80070.002 


09/479,221 


1/7/2000 


United States 


2750-068 IP 


80182.002 


60/176,866 


1/19/2000 


United States 


2750-0685P 


80183.002 


60/176,867 


1/19/2000 


United States 


2750-0688P 


80184.002 


60/176,910 


1/20/2000 


United States 


2750-0689P 


00152.001 


60/178,166 


1/26/2000 


United States 


2750-0682P 


80183.001 


60/178,546 


1/27/2000 


United States 


2750-0687P 


80184.001 


60/178,545 


1/27/2000 


United States 


2750-0690P 


00153.001 


60/178,547 


1/27/2000 


United States 


2750-0691P 


80185.001 


60/177,666 


1/27/2000 


United States 


2750-0680P 


80182.001 


60/178,544 


1/27/2000 


United States 


2750-0692P 


00155.001 


60/178,754 


1/28/2000 
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United States 


2750-0693P 


80186.001 


60/178,755 


1/28/2000 


United States 


2750-0695P 


00157.001 


60/179,395 


2/1/2000 


United States 


2750-0696P 


80187.001 


60/179,388 


2/1/2000 


United States 


2750-0698P 


80188.001 


60/180,139 


2/3/2000 


United States 


2750-0697P 


00158.001 


60/180,039 


2/3/2000 


United States 


2750-0699P 


00159.001 


60/180,206 


2/4/2000 


United States 


2750-0700P 


80189.001 


60/180,207 


2/4/2000 


United States 


2750-0701P 


00160.001 


60/180,695 


2/7/2000 


United States 


2750-0702P 


80190.001 


60/180,696 


2/7/2000 


United States 


2750-0703P 


00161.001 


60/181,228 


2/9/2000 


United States 


2750-0704P 


80191.001 


60/181,214 


2/9/2000 


United States 


2750-0705P 


00162.001 


60/181,476 


2/10/2000 


United States 


2750-0706P 


80192.001 


60/181,551 


2/10/2000 


United States 


2750-0712P 


00164.001 


60/182,512 


2/15/2000 


United States 


2750-0707P 


00163.001 


60/182,477 


2/15/2000 


United States 


2750-0713P 


80194.001 


60/182,478 


2/15/2000 


United States 


2750-0708P 


80193.001 


60/182,516 


2/15/2000 


United States 


2750-0714P 


00165.001 


60/183,166 


2/17/2000 


United States 


2750-0715P 


80195.001 


60/183,165 


2/17/2000 


iviexico 


z, / j i \jyr ^iviyv ) 


80090 1 01 

Ivl 


00/001 973 


2/25/2000 


Europe 


2750-0709F(EP) 


80090.103 


03/014,396 


2/25/2000 


United States 


2750-0709P 


80090.002 


09/513,996 


.2/25/2000 


Canada 


2750-0709F(CA) 


80090.102 


23/006,92 


2/25/2000 


United States 


2750-0783P 


91000.002 


09/543,680 


4/6/2000 


United States 


2750-0788P 


80123.002 


09/559,232 


4/28/2000 


United States 


2750-0852P 


80126.002 


09/566,262 


5/4/2000 


United States 


2750-085 IP 


91002.002 


09/565,308 


5/5/2000 


United States 


2750-0855P 


80130.002 


09/565,310 


5/5/2000 


United States 


2750-0853P 


80127.002 


09/565,309 


5/5/2000 


United States 


2750-0854P 


80129.002 


09/565,307 


5/5/2000 


United States 


2750-0871P 


80131.002 


09/572,408 


5/11/2000 


United States 


2750-0873P 


00025.002 


09/570,582 


5/12/2000 


United States 


2750-0875P 


91006.002 


09/570,581 


5/12/2000 


United States 


2750-0874P 


80116.002 


09/570,738 


5/12/2000 


United States 


2750-0872P 


80117.002 


09/570,768 


5/12/2000 


United States 


2750-0876P 


91007.002 


09/573,655 


5/18/2000 


United States 


2750-0928P 


00033.003 


09/592,459 


6/9/2000 


United States 


2750-0934P 


00034.002 


09/593,710 


6/14/2000 


United States 


2750-0953P 


80132.022 


09/595,298 


6/16/2000 


United States 


2750-0946P 


80132.015 


09/595,328 


6/16/2000 


United States 


2750-0949P 


80132.018 


09/595,332 


6/16/2000 


United States 


2750-0948P 


80132.017 


09/595,329 


6/16/2000 


United States 


2750-0954P 


80132.023 


09/594,599 


6/16/2000 


United States 


2750-0941P 


00037.002 


09/595,334 


6/16/2000 
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United States 


2750-0955P 


80132.024 


09/595,331 


6/16/2000 


United States 


2750-0947P 


80132.016 


09/595,335 


6/16/2000 


United States 


2750-0945P 


80132.014 


09/595,333 


6/16/2000 


United States 


2750-0943P 


00039.002 


09/596,577 


6/16/2000 


United States 


2750-0942P 


00038.002 


09/595,326 


6/16/2000 


United States 


2750-095 IP 


80132.020 


09/594,595 


6/16/2000 


United States 


2750-0944P 


80132.013 


09/595,330 


6/16/2000 


United States 


2750-0950P 


80132.019 


09/594,598 


6/16/2000 


United States 


2750-0952P 


80132.021 


09/594,597 


6/16/2000 


United States 


2750-0971P 


00042.003 


09/602,660 


6/21/2000 


United States 


2750-0972P 


00043.002 


09/602,152 


6/22/2000 


United States 


2750-0973P 


00044.002 


09/602,025 


6/23/2000 


United States 


2750-0975P 


00045.002 


09/602,016 


6/23/2000 


United States 


2750-0976P 


00046.002 


09/605,843 


6/28/2000 


United States 


2750-0977P 


00048.002 


09/606,181 


6/29/2000 


United States 


2750-0979P 


00050.002 


09/607,081 


6/30/2000 


United States 


2750-0978P 


00049.002 


09/608,960 


6/30/2000 


United States 


2750-0980P 


00051.002 


09/610,157 


6/30/2000 


United States 


2750-0981P 


00052.002 


09/609,198 


6/30/2000 


United States 


2750-0982P 


00053.002 


09/611,409 


7/6/2000 


United States 


2750-0983P 


00054.002 


09/612,645 


7/7/2000 


United States 


2750-0984P 


00058.002 


09/613,547 


7/7/2000 


United States 


2750-0985P 


00059.002 


09/615,007 


7/12/2000 


United States 


2750-0986P 


00060.002 


09/615,748 


7/13/2000 


United States 


2750-0987P 


00061.002 


09/617,525 


7/14/2000 


United States 


2750-0988P 


00062.002 


09/617,203 


7/14/2000 


United States 


2750-1061P 


80134.018 


09/614,450 


7/14/2000 


United States 


2750-1060P 


80134.017 


09/614,388 


7/14/2000 


United States 


2750-1064P 


80134.024 


09/617,681 


7/19/2000 


United States 


2750-1063P 


80134.022 


09/617,682 


7/19/2000 


United States 


2750-1062P 


80134.020 


09/617,683 


7/19/2000 


United States 


2750-0989P 


00064.002 


09/620,421 


7/19/2000 


United States 


2750-1066P 


80135.004 


09/620,978 


7/20/2000 


United States 


2750-1065P 


80134.026 


09/620,998 


7/20/2000 


United States 


2750-0990P 


00065.002 


09/621,323 


7/20/2000 


United States 


2750-1069P 


80134.023 


09/620,313 


7/21/2000 


United States 


2750-1068P 


80134.016 


09/620,393 


7/21/2000 


United States 


2750-0991P 


00066.002 


09/621,630 


7/21/2000 


United States 


2750-1071P 


80134.021 


09/620,390 


7/21/2000 


United States 


2750-1067P 


80134.015 


09/620,394 


7/21/2000 


United States 


2750-1072P 


80135.003 


09/621,900 


7/21/2000 


United States 


2750-0992P 


00067.002 


09/621,660 


7/21/2000 


United States 


2750-1070P 


80134.019 


09/620,111 


7/21/2000 


United States 


2750-0993P 


00069.002 


09/621,902 


7/21/2000 


United States 


2750-1073P 


80134.025 


09/620,314 


7/21/2000 . 


United States 


2750-0994P 


00070.002 


09/616,628 


7/26/2000 
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United States 


2750-1075P 


80136.005 


09/628,987 


7/27/2000 


United States 


2750-1074P 


80136.004 


09/628,984 


7/27/2000 


United States 


2750-0995P 


00071.002 


09/628,986 


7/27/2000 


United States 


2750-0996P 


00072.002 


09/628,552 


7/28/2000 


United States 


2750-1076P 


80137.003 


09/628,985 


8/2/2000 


United States 


2750-1077P 


80137.004 


09/630,442 


8/2/2000 


United States 


2750-0997P 


00073.002 


09/632,340 


8/2/2000 


United States 


2750-0998P 


00074.002 


09/632,349 


8/3/2000 


United States 


2750-1001P 


00079.002 


09/633,239 


8/4/2000 


United States 


2750-1078P 


80138.003 


09/635,640 


8/4/2000 


United States 


2750-1092P 


80138.004 


09/635,643 


8/4/2000 


United States 


2750-0999P 


00076.002 


09/633,051 


8/4/2000 


United States 


2750-1000P 


00077.002 


09/633,191 


8/4/2000 


United States 


2750-1002P 


00080.002 


09/635,277 


8/9/2000 


United States 


2750-1094P 


80139.004 


09/635,642 


8/9/2000 


United States 


2750-1093P 


80139.003 


09/635,641 


8/10/2000 


United States 


2750-1005P 


00083.002 


09/637,563 


8/11/2000 


United States 


2750-1004P 


00082.002 


09/636,555 


8/11/2000 


United States 


2750-1096P 


80142.004 


09/637,780 


8/11/2000 


United States 


2750-1003P 


00081.002 


09/637,837 


8/11/2000 


United States 


2750-1095P 


80142.003 


09/643,672 


8/16/2000 


United States 


2750-1006P 


00084.002 


09/641,198 


8/17/2000 


United States 


2750-1007P 


00085.002 


09/641,359 


8/18/2000 


United States 


2750-1008P 


00086.002 


09/641,375 


8/18/2000 


United States 


2750-1098P 


80143.004 


09/643,671 


8/18/2000 


United States 


2750-1009P 


00087.002 


09/640,695 


8/18/2000 


United States 


2750-1010P 


00088.002 


09/643,854 


8/23/2000 


United States 


2750-1097P 


80143.003 


09/649,866 


8/23/2000 


United States 


2750-1205P 


80289.079 


60/228,053 


8/25/2000 


United States 


2750-1 171P 


80289.045 


60/228,152 


8/25/2000 


United States 


2750-1203P 


80289.077 


60/227,939 


8/25/2000 


United States 


2750-121 IP 


80289.085 


60/228,161 


8/25/2000 


United States 


2750-1210P 


80289.084 


60/228,164 


8/25/2000 


United States 


2750-1209P 


80289.083 


60/228,054 


8/25/2000 


United States 


2750-1208P 


80289.082 


60/228,189 


8/25/2000 


United States 


2750-1206P 


80289.080 


60/227,978 


8/25/2000 


United States 


2750-1204P 


80289.078 


60/227,955 


8/25/2000 


United States 


2750-1207P 


80289.081 


60/227,982 


8/25/2000 


United States 


2750-1212P 


80289.086 


60/228,165- 


8/25/2000 


United States 


2750-1213P 


80289.087 


60/228,221 


8/25/2000 


United States 


2750-1214P 


80289.088 


60/228,240 


8/25/2000 


United States 


2750-1215P 


80289.089 


60/227,979 


8/25/2000 


United States 


2750-1216P 


80289.090 


60/227,954 


8/25/2000 


United States 


2750-1217P 


80289.091 


60/228,217 


8/25/2000 


United States 


2750-1218P 


80289.092 


60/227,929 


8/25/2000 


United States 


2750-1219P 


80289.093 


60/228,043 


8/25/2000 
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Attorney No 
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Filing Date 


United States 


2750-1220P 


80289.094 


60/227,931 


8/25/2000 


United States 


2750-1221P 


80289.095 


60/228,187 


8/25/2000 


United States 


2750-1 133P 


80289.018 


60/228,025 


8/25/2000 


United States 


2750-1223P 


80289.097 


60/228,150 


8/25/2000 


United States 


2750-1 147P 


80289.032 


60/228,033 


8/25/2000 


United States 


2750-1224P 


80289.098 




8/25/2000 


United States 


2750-1222P 


80289.096 


60/228,061 


8/25/2000 


United States 


2750-1 148P 


80289.033 


60/228,024 


8/25/2000 


United States 


2750-2 133P 


80289.099 


60/228,041 


8/25/2000 


United States 


2750-1 170P 


80289.044 


60/228,132 


8/25/2000 


United States 


2750-1 169P 


80289.043 


60/228,049 


8/25/2000 


United States 


2750-1 168P 


80289.042 


60/228,052 


8/25/2000 


United States 


2750-1 167P 


80289.041 


60/228,047 


8/25/2000 


United States 


2750-1 166P 


80289.040 


60/228,098 


8/25/2000 


United States 


2750-1 165P 


80289.039 


60/228,046 


8/25/2000 


United States 


2750-1 164P 


80289.038 


60/228,163 


8/25/2000 


United States 


2750-1 152P 


80289.037 


60/227,774 


8/25/2000 


United States 


2750-1 151P 


80289.036 


60/227,725 


8/25/2000 


United States 


2750-1 145P 


80289.030 


60/227,728 


8/25/2000 


United States 


2750-1 149P 


80289.034 


60/227,769 


8/25/2000 


United States 


2750-1 134P 


80289.019 


60/227,781 


8/25/2000 


United States 


2750-1 146P 


80289.031 


60/227,773 


8/25/2000 


United States 


2750-1 144P 


80289.029 


60/227,770 


8/25/2000 


United States 


2750-1 143P 


80289.028 


60/227,730 


8/25/2000 


United States 


2750-1 142P 


80289.027 


60/227,733 


8/25/2000 


United States 


2750-1 141P 


80289.026 


60/227,792 


8/25/2000 


United States 


2750-1 HOP 


80289.025 


60/227,734 


8/25/2000 


United States 


2750-1 139P 


80289.024 


60/228,167 


8/25/2000 


United States 


2750-1 138P 


80289.023 


60/227,729 


8/25/2000 


United States 


2750-1 137P 


80289.022 


60/227,732 


8/25/2000 


United States 


2750-1 136P 


80289.021 


60/227,731 


8/25/2000 


United States 


2750-1 135P 


80289.020 


60/227,783 


8/25/2000 


United States 


2750-1 150P 


80289.035 


60/227,780 


8/25/2000 


United States 


2750-1 177P 


80289.051 


60/228,320 


8/25/2000 


United States 


2750-1 195P 


80289.069 


60/228,216 


8/25/2000 


United States 


2750-1 196P 


80289.070 


60/228,065 


8/25/2000 


United States 


2750-1 197P 


80289.071 


60/227,975 


8/25/2000 


United States 


2750-1 198P 


80289.072 


60/228,181 


8/25/2000 


United States 


2750-1 199P 


80289.073 


60/228,063 


8/25/2000 


United States 


2750-1202P 


80289.076 


60/228,074 


8/25/2000 


United States 


2750-1200P 


80289.074 


60/228,064 


8/25/2000 


United States 


2750-1201P 


80289.075 


60/228,055 . 


8/25/2000 


United States 


2750-1 173P 


80289.047 


60/228,322 


8/25/2000 


United States 


2750-1 174P 


80289.048 


60/228,156 


8/25/2000 


United States 


2750-1 100P 


80144.004 


09/649,867 


8/25/2000 


United States 


2750-1 176P 


80289.050 


60/228,133 


8/25/2000 
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United States 


2750-1 192P 


80289:066 


60/227,932 


8/25/2000 


United States 


2750-1 178P 


80289.052 


60/228,159 


8/25/2000 


United States 


2750-1 179P 


80289.053 


60/228,151 


8/25/2000 


United States 


2750-1 182P 


80289.056 


60/228,153 


8/25/2000 


United States 


2750-1 180P 


80289.054 


60/228,202 


8/25/2000 


United States 


2750-1 181P 


80289.055 


60/228,208 


8/25/2000 


United States 


2750-1013P 


00091.002 


09/645,441 


8/25/2000 


United States 


2750-1012P 


00090.002 


09/648,708 


8/25/2000 


United States 


2750-1 172P 


80289.046 


60/228,135 


8/25/2000 


United States 


2750-1099P 


80144.003 


09/649,868 


8/25/2000 


United States 


2750-101 IP 


00089.002 


09/645,440 


8/25/2000 


United States 


2750-2 132P 


80289.217 


60/227,791 


8/25/2000 


United States 


2750-1 175P 


80289.049 


60/228,323 


8/25/2000 


United States 


2750-2 130P 


80289.215 


60/227,777 


8/25/2000 


United States 


2750-21 17P 


80289.202 


60/228,031 


8/25/2000 


United States 


2750-21 18P 


80289.203 


60/228,028 


8/25/2000 


United States 


2750-21 19P 


80289.204 


60/228,027 


8/25/2000 


United States 


2750-2 120P 


80289.205 




8/25/2000 


United States 


2750-2 12 IP 


80289.206 


60/228,026 


8/25/2000 


United States 


2750-2122P 


80289.207 


60/228,038 


8/25/2000 


United States 


2750-2123P 


80289.208 


60/228,036 


8/25/2000 


United States 


2750-2124P 


80289.209 


60/227,790 


8/25/2000 


United States 


2750-2 125P 


80289.210 


60/228,039 


8/25/2000 


United States 


2750-2 126P 


80289.211 


60/228,030 


8/25/2000 


United States 


2750-2 127P 


80289.212 


60/228,032 


8/25/2000 


United States 


2750-1 194P 


80289.068 


60/228,044 


8/25/2000 


United States 


2750-2129P 


80289.214 


60/228,040 


8/25/2000 


United States 


2750-1 193P 


80289.067 


60/227,936 


8/25/2000 


United States 


2750-2131P 


80289.216 


60/228,037 


8/25/2000 


United States 


2750-1 183P 


80289.057 


60/228,179 


8/25/2000 


United States 


2750-1 184P 


80289.058 


60/228,180 


8/25/2000 


United States 


2750-1 185P 


80289.059 


60/228,209 


8/25/2000 


United States 


2750-1 186P 


80289.060 


60/228,178 


8/25/2000 


United States 


2750-1 187P 


80289.061 


60/228,177 


8/25/2000 


United States 


2750-1 188P 


80289.062 


60/227,976 


8/25/2000 


United States 


2750-1 189P 


80289.063 


60/228,207 


8/25/2000 


United States 


2750-1 190P 


80289.064 


60/228,048 


8/25/2000 


United States 


2750-1 191P 


80289.065 


60/228,096 


8/25/2000 


United States 


2750-21 16P 


80289.201 


60/227,793 


8/25/2000 


United States 


2750-2128P 


80289.213 


60/228,149 


8/25/2000 


United States 


2750-1014P 


00092.002 


09/651,370 


8/30/2000 


United States 


2750-1015P 


00093.002 


09/653,466 


8/31/2000 


United States 


2750-1016P 


00094.002 


09/654,547 


9/1/2000 


United States 


2750-1017P 


00095.002 


09/657,454 


9/7/2000 


United States 


2750-101 8P 


00096.002 


09/657,569 


9/8/2000 


United States 


2750-1019P 


00098.002 


09/660,883 


9/13/2000 
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United States 


2750-1020P 


00099.002 


09/663,196 


9/15/2000 


United States 


2750-1021P 


00101.002 


09/663,195 


9/15/2000 


United States 


2750-1022P 


00102.002 


09/665,714 


9/20/2000 


United States 


2750-1023P 


00103.002 


09/667,597 


9/22/2000 


United States 


2750-1025P 


00105.002 


09/667,229 


9/22/2000 


United States 


2750-1024P 


00104.002 


09/667,517 


9/22/2000 


United States 


2750-1026P 


00106.002 


09/671,635 


9/28/2000 


United States 


2750-1027P 


00107.002 


09/672,075 


9/29/2000 


United States 


2750-1028P 


00108.002 


09/679,203 


10/4/2000 


United States 


2750-1029P 


00109.002 


09/678,223 


10/5/2000 


United States 


2750-1030P 


00110.002 


09/680,499 


10/6/2000 


United States 


2750-103 IP 


00111.002 


09/680,490 


10/6/2000 


United States 


2750-1032P 


00112.002 


09/680,498 


10/6/2000 


United States 


2750-1033P 


00113.002 


09/686,093 


10/12/2000 


United States 


2750-1235P 


80145.003 


09/689,984 


10/13/2000 


United States 


2750-1240P 


80147.004 


09/689,983 


10/13/2000 


United States 


2750-1239P 


80147.003 


09/689,982 


10/13/2000 


United States 


2750-1242P 


80148.004 


09/688,051 


10/13/2000 


United States 


2750-1238P 


80146.004 


09/688,052 


10/13/2000 


United States 


2750-1237P 


80146.003 


09/689,980 


10/13/2000 


United States 


2750-1241P 


80148.003 


09/689,981 


10/13/2000 


United States 


2750-1236P 


80145.004 


09/688,050 


10/13/2000 


United States 


2750-1034P 


00116.002 


09/690,745 


10/18/2000 


United States 


2750-1301P 


00120.002 


09/692,108 


10/20/2000 


United States 


2750-1297P 


80149.004 


09/692,714 


10/20/2000 


United States 


2750-1303P 


80151.004 


09/692,696 


10/20/2000 


United States 


2750-1302P 


80151.003 


09/692,152 


10/20/2000 


United States 


2750-1295P 


00118.002 


09/692,157 


10/20/2000 


United States 


2750-1296P 


80149.003 


09/692,154 


10/20/2000 


United States 


2750-1 300P 


80150.004 


09/692,717 


10/20/2000 


United States 


2750-1298P 


00119.002 


09/692,153 


10/20/2000 


United States 


2750-1299P 


80150.003 


09/692,148 


10/20/2000 


United States 


2750-1308P 


00121.002 


09/695,391 


10/25/2000 


United States 


2750-1309P 


80152.003 


09/696,751 


10/25/2000 


United States 


2750-1310P 


80152.004 


09/695,387 


10/25/2000 


United States 


2750-13 12P 


80153.003 


09/696,284 


10/26/2000 


United States 


2750-1313P 


80153.004 


09/696,017 


10/26/2000 


United States 


2750-131 IP 


00122.002 


09/696,305 


10/26/2000 


United States 


2750-1317P 


00124.002 


09/697,718 


10/27/2000 


United States 


2750-13 19P 


80155.004 


09/697,076 


10/27/2000 


United States 


2750-13 18P 


80155.003 


09/697,081 


10/27/2000 


United States 


2750-1316P 


80154.004 


09/697,145 


10/27/2000 


United States 


2750-131 5P 


80154.003 


09/697,056 


10/27/2000 


United States 


2750-13 14P 


00123.002 


09/697,080 


10/27/2000 


United States 


2750-1330P 


80156.003 


09/702,841 


11/1/2000 


United States 


2750-1331P 


80156.004 


09/702,873 


11/1/2000 
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United States 


2750-1329P 


00125.002 


09/702,840 


11/1/2000 


United States 


2750-1332P 


00126.002 


09/703,932 


11/2/2000 


United States 


2750-1333P 


80157.003 


09/703,627 


11/2/2000 


United States 


2750-1334P 


80157.004 


09/703,619 


11/2/2000 


United States 


2750-1339P 


80159.003 


09/704,541 


11/3/2000 


United States 


2750-1338P 


00128.002 


09/704,542 


11/3/2000 


United States 


2750-1340P 


80159.004 


09/704,540 


11/3/2000 


United States 


2750-1335P 


00127.002 


09/704,559 


11/3/2000 


United States 


2750-1336P 


80158.003 


09/704,550 


11/3/2000 


United States 


2750-1337P 


80158.004 


09/704,836 


11/3/2000 


United States 


2750-1347P 


00129.002 


09/708,092 


11/8/2000 


United States 


2750-1243P 


80161.003 


09/708,427 


11/9/2000 


United States 


2750-1249P 


00143.002 


09/726,578 


12/1/2000 


United States 


2750-1250P 


80180.003 


09/731,809 


12/8/2000 


United States 


2750-1387P 


80182.003 


09/764,425 


1/19/2001 


United States 


2750-1 390P 


00153.002 


09/769,525 


1/26/2001 


United States 


2750-1401P 


00157.002 


09/774,806 


2/1/2001 


United States 


2750-1411P 


80194.002 


09/783,606 


2/15/2001 


United States 


2750-1467P 


80090.004 




8/24/2001 



The entire contents of the applications listed in the table above are expressly 
incorporated herein by reference. 

This application contains a CDR, the entire contents of which are hereby incorporated by 
reference. The CDR contains the following files: 



File Name 


Date Created 


File Size 


KNOCK-IN_01 


8/24/2001 


11.1 KB 


Knock-in _02 


8/24/2001 


6.78 KB 


knockout 


8/24/2001 


1.56 MB 


docket 80090 101 cdna map II delta 


8/24/2001 


1.20 KB 


Cluster Functions and Utilities 01 


8/24/2001 


12.3 KB 


Cluster Functions and Utilities 02 


8/24/2001 


88.8 KB 


Cluster Functions and Utilities 03 


8/24/2001 


6.10 KB 


Cluster Functions and Utilities 04 


8/24/2001 


6.14 KB 


Cluster Functions and Utilities 05 


8/24/2001 


36.4 KB 


Cluster Functions and Utilities 06 


8/24/2001 


94.2 KB 


Cluster Functions and Utilities 07 


8/24/2001 


8.24 KB 


Cluster Functions and Utilities 08 


8/24/2001 


16.6 KB 


Protein Domain Table 


8/24/2001 


2.62 MB 
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FIELD OF THE INVENTION 

The present invention relates to over 100,000** isolated polynucleotides that include 
a complete coding sequence, or a fragment thereof, that is expressed. These polynucleotides 
come from five plant species. In addition, the present invention relates to the polypeptide or 
protein corresponding to the coding sequence of these polynucleotides. The present 
invention also relates to isolated polynucleotides that represent regulatory regions of genes. 
The present invention also relates to isolated polynucleotides that represent untranslated 
regions of genes. The present invention further relates to the use of these isolated 
polynucleotides and polypeptides and proteins. 

BACKGROUND AND SUMMARY OF THE INVENTION 

There are more than 300,000 species of plants. They show a wide diversity of forms, 
ranging from delicate liverworts, adapted for life in a damp habitat, to cacti, capable of 
surviving in the desert. The plant kingdom includes herbaceous plants, such as corn, whose 
life cycle is measured in months, to the giant redwood tree, which can live for thousands of 
years. This diversity reflects the adaptations of plants to survive in a wide range of habitats. 
This is seen most clearly in the flowering plants (phylum Angiospermophyta), which are the 
most numerous, with over 250,000 species. They are also the most widespread, being found 
from the tropics to the arctic. 

The process of plant breeding involving man's intervention in natural breeding and 
selection, is some 20,000 years old. It has produced remarkable advances in adapting 
existing species to serve new purposes. The world's economics was largely based on the 
successes of agriculture for most of these 20,000 years. 

Plant breeding involves choosing parents, making crosses to allow recombination of 
gene (alleles) and searching for and selecting improved forms. Success depends on the 
genes/alleles available, the combinations required and the ability to create and find the correct 
combinations necessary to give the desired properties to the plant. Molecular genetics 
technologies are now capable of providing new genes, new alleles and the means of creating 
and selecting plants with the new, desired characteristics. 

When the molecular and genetic basis for different plant characteristics are 
understood, a wide variety of polynucleotides, both endogenous polynucleotides and created 
variants, polypeptides, cells, and whole organisms, can be exploited to engineer old and new 
plant traits in a vast range of organisms including plants. These traits can range from the 
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observable morphological characteristics, through adaptation to specific environments to 
biochemical composition and to molecules that the plants (organisms) exude. Such 
engineering can involve tailoring existing traits, such as increasing the production of taxol in 
yew trees, to combining traits from two different plants into a single organism, such as 
inserting the drought tolerance of a cactus into a corn plant. Molecular and genetic 
knowledge also allows the creation of new traits. For example, the production of chemicals 
and pharmaceuticals that are not native to particular species or the plant kingdom as a whole. 

The application reports the inventions Applicants have discovered to build a 
foundation of scientific understanding of plant genomes to achieve these aims. These 
inventions include polynucleotide and polypeptide sequences, and data relating to where and 
when the genes are differentially expressed and phenotypic observations resulting from either 
aberrant gene activation or disruption. How these data are transformed into a scientific 
understanding of plant biology and the control of traits from a genetic perspective also is 
explained by the instant application. Applications of these discoveries to create new 
prototypes and products in the field of chemical, pharmaceutical, food, feed, and fiber 
production are described herein as well. 

The achievements desribed in this application were possible because of the results 
from a cluster of technologies, a genomic engine, depicted below in Schematic 1, that allows 
information on each gene to be integrated to provide a more comprehensive understanding of 
gene structure and function and the deployment of genes and gene components to make new 
products. 



a) The discoveries of the instant application 
Applicants have isolated and identified over one hundred thousand genes, gene 
components and their products and thousands of promoters. The genes were isolated and/or 
characterized from arabidopsis, soybean, maize, wheat and rice. These species were selected 
because of their economic value and scientific importance and were deliberately chosen to 
include representatives of the evolutionary divergent dicotyledonous and monocotyledonous 
groups of the plant kingdom. The number of genes characterized in this application 
represents a large proportion of all the genes in these plant species. 
The techniques used initially to isolate and characterize most of the genes, namely 
sequencing of full-length cDNAs, were deliberately chosen to provide information on 
complete coding sequences and on the complete sequences of their protein products. 
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Gene components and products the Applicants have identified include exons, introns, 
promoters, coding sequences, antisense sequences, terminators and other regulatory 
sequences. The exons are characterized by the proteins they encode and arabidopsis 
promoters are characterized by their position in the genomic DNA relative to where mRNA 
5 synthesis begins and in what cells and to what extent they promote mRNA synthesis. 
Further exploitation of molecular genetics technologies have helped the Applicants to 
understand the functions and characteristics of each gene and their role in a plant. Three 
powerful molecular genetics approaches were used to this end: 

(a) Analyses of the phenotypic changes when the particular gene sequence is 
10 interrupted or activated differentially; (arabidopsis) 

(b) Analyses of in what plant organs, to what extent, and in response to what 
environmental signals mRNA is synthesized from the gene; (arabidopsis and 
maize) and 

O (c) Analysis of the gene sequence and its relatives, (all species) 

J These were conducted using the genomics engine depicted in Figure 1 that allows 

fU information on each gene to be integrated to provide a more comprehensive understanding of 
£ri gene structure and function and linkage to potential products. 

!L The species arabidopsis was used extensively in these studies for several reasons: (1) 

HO the complete genomic sequence, though poorly annotated in terms of gene recognition, was 

being produced and published by others and (2) genetic experiments to determine the role of 

the genes in planta are much quicker to complete. 

The phenotypic tables, MA tables, and reference tables and sequence tables indicate 

the results of these analyses and thus the specific functions and characteristics that are 
25 ascribed to the genes and gene components and products. 
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(1) Polypeptide Functional 
Analysis 



(2) Differential 
Expression (Mcroarrays) 




(5) Finding and 
Characterizing Promoters 






(3) Gene Disruption 
(Knock-out) 



(4) Ectopic Gene Expression 
(Knock-in) 




Application as Targets and Products 





Fiber 



(6) 



Schematic 1. Gene sequences were determined and are depicted to occupy 
the center of the figure. Five different sorts of technologies were deployed 
in the Genomics engine to discover the functions of the genes. (1) 
Computer-based comparisons of protein structural features. (2) Studies to 
discover where and when each gene and groups of genes are active. (3) 
Discovery of the phenotypic consequences of inactivating each gene. (4) 
Elucidation of the phenotypic consequences of activating a gene in a new 
way. (5) Discovery of the sequence and activity of promoters of the genes. 
All this information leads to knowledge of how to use the genes, and gene 
Sohipmiigtslto create new products for industrial applications. (6) 

Integration of discoveries to provide scientific understanding 

From the discoveries made, Applicants have deduced the biochemical activities, 
pathways, cellular roles, and developmental and physiological processes that can be 
modulated using these components. These are discussed and summarized in sections based 
on the gene functions characteristics from the analyses and role in determining phenotypes. 
These sections illustrate and emphasize that each gene, gene component or product influences 
biochemical activities, cells or organisms in complex ways, from which there can be many 
phenotypic consequences. 

An illustration of how the discoveries on gene structure, function, expression and 
phenotypic observation can be integrated together to understand complex phenotypes is 
provided in Figure 2. This sort of understanding enables conclusions to be made as to how 
the genes, gene components and product are useful for changing the properties of plants and 



Page 24 of 772 




Attorney DKt. Z/^U-lWif 
Client Dkt. 80090.005 



other organisms. This example also illustrates how single gene changes in, for example, a 
metabolic pathway can cause gross phenotypic changes. 

Schematic 2 



Genes A, B, C, & D 
on a c 




Proteins A, B, C, & D 



| Translation ^) ^^ ^ ^ 



Pathway Action 





„ (Dwarf as compared to wildtype) 



.Cell ular Response 



(Hormone Synthesis Pathway) 



Schematic 2. The figure illustrates how genes A, B, C andD are activated by 
internal stimuli and then their mRNA transcripts translated into proteins. These 
proteins are enzymes in three different but linked pathways. All three pathways are 
activated by the same stimuli One of them, depicted by the green and light blue 
proteins determines the levels of a hormone in the shoot meristems causes cells to 
expand. This cell expansion leads to a longer stem and a taller plant Genes A&C 
are therefore useful for controlling plant height and stem strength. The other two 



Furthermore, the development and properties of one part of plant can be 
interconnected with other parts. The dependence of shoot and leaf development on root cells 
is a classic example. Here, shoot growth and development require nutrients supplied from 
roots, so the protein complement of root cells can affect plant development, including flowers 
and seed production. Similarly, root development is dependent on the products of 
photosynthesis from leaves. Therefore, proteins in leaves can influence root developmental 
physiology and biochemistry. 

Thus, the Utility and Application sections describe both the functions and 
characteristics of the genes, gene components and products and also the multiplicity of 
biochemical activities, cellular functions, and the developmental and physiological processes 
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influenced by them. The sections also describe examples of commercial products that can be 
realized from the inventions. 



Analyses to Reveal Function and in vivo roles of single genes in one plant 
species. 

The genomics engine has focused on individual genes to reveal the multiple functions 
or characteristics that are associated to each gene, gene components and products of the 
instant invention in the living plant. For example, the biochemical activity of a protein is 
deduced based on its similarity to a protein of known function. In this .case, the protein may 
be ascribed with, for example, an oxidase activity. Where and when this same protein is 
active can be uncovered from differential expression experiments, which show that the 
mRNA encoding the protein is differentially expressed in response to drought and in seeds 
but not roots. The gene disruption experiments reveal that absence of the same protein causes 
embryo lethality. 

Thus, this protein is characterized as a seed protein and drought-responsive oxidase 
that is critical for embryo viability. 

Analyses to Reveal Function and roles of single genes in different species. 
The genomics engine has also been used to extrapolate knowledge from one species 

to many plant species. For example, proteins from different species, capable of performing 

identical or similar functions, preserve many features of amino acid sequence and structure 

during evolution. Complete protein sequences have been compared and contrasted within 

and between species to determine the functionally vital domains and signatures characteristic 

of each of the proteins that is the subject of this application. Thus, functions and 

characteristics of arabidopsis proteins have been extrapolated to proteins containing similar 

domains and signatures of corn, soybean, rice and wheat and by implication to all other 

(plant) species. 

Schematic 3 provides an example. Two proteins with related structures, one from 
corn, a monocot, and one from arabidopsis, a dicot, have been concluded to be orthologs. 
The known characteristics of the arabidopsis protein (seed protein, drought responsive 
oxidase) can then be attributed to the corn protein. 
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Schmatic 3. 



Integration of Data Across Species to link Gene Products and Phenotypes 



Arabidopsis 
Protein 



Similar sequence 
& 

Similar structure 



Maize 
Protein 





Thus 
Similar function 



> 





Arabidopsis 



Maize 



Analyses over multiple experiments to reveal gene networks and links across species 
The genomics engine can identify networks or pathways of genes concerned with the 
same process and hence linked to the same phenotype(s). Genes specifying functions of the 
same pathway or developmental environmental responses are frequently co-regulated i.e. they 
are regulated by mechanisms that result in coincident increases or decreases for all gene 
members in the group. The Applicants have divided the genes of arabidopsis and maize into 
such co-regulated groups on the basis of their expression patterns and the function of each 
group has been deduced. This process has provided considerable insight into the function 
and role of thousands of the plant genes in diverse species included in this application. 

Applications of Applicant's discoveries 

It will be appreciated while reading the sections that the different experimental 
molecular genetic approaches focused on different aspects of the pathway from gene and 
gene product through to the properties of tissues, organs and whole organisms growing in 
specific environments. For each endogenous gene, these pathways are delineated within the 
existing biology of the species. However, Applicants' inventions allow gene components or 
products to be mixed and matched to create new genes and placed in other cellular contexts 
and species, to exhibit new combinations of functions and characteristics not found in nature, 
or to enhance and modify existing ones. For instance, gene components can be used to 
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achieve expression of a specific protein in a new cell type to introduce new biochemical 
activities, cellular attributes or developmental and physiological processes. Such cell-specific 
targeting can be achieved by combining polynucleotides encoding proteins with any one of a 
large array of promoters to facilitate synthesis of proteins in a selective set of plant cells. 
This emphasizes that each gene, component and protein can be used to cause multiple and 
different phenotypic effects depending on the biological context. The utilities are therefore 
not limited to the existing in vivo roles of the genes, gene components, and gene products. 

While the genes, gene components and products disclosed herein can act alone, 
combinations are useful to modify or modulate different traits. Useful combinations include 
different polynucleotides and/or gene components or products that have (1) an effect in the 
same or similar developmental or biochemical pathways; (2) similar biological activities; (3) 
similar transcription profiles; or (4) similar physiological consequences. 

Of particular interest are the transcription factors and key factors in regulatory 
transduction pathways, which are able to control entire pathways, segments of pathways or 
large groups of functionally related genes. Therefore, manipulation of such proteins, alone or 
in combination is especially useful for altering phenotypes or biochemical activities in plants. 
Because interactions exist between hormone, nutrition, and developmental pathways, 
combinations of genes and/or gene products from these pathways also are useful to produce 
more complex changes. In addition to using polynucleotides having similar transcription 
profiles and/or biological activities, useful combinations include polynucleotides that may 
exhibit different transcription profiles but which participate in common or overlapping 
pathways. Also, polynucleotides encoding selected enzymes can be combined in novel ways 
in a plant to create new metabolic pathways and hence new metabolic products. 

The utilities of the various genes, gene components and products of the Application 
are described in the sections as follows: 

I. Organ Affecting Genes, Gene Components, Products (Including Differentiation Function) 
LA. Root Genes, Gene Components And Products 

I.A.I. Root Genes, Gene Components And Products 
I.A.2. Root Hair Genes, Gene Components And Products 

LB. Leaf Genes, Gene Components And Products 

I.B.I. Leaf Genes, Gene Components And Products 
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I.B.2. Trichome Genes And Gene Components 
LB. 3. Chloroplast Genes And Gene Components 



I.C. Reproduction Genes, Gene Components And Products 

I.C.I. Reproduction Genes, Gene Components And Products 

I.C.2. Ovule Genes, Gene Components And Products 

I.C.3. Seed And Fruit Development Genes, Gene Components And Products 

LD. Development Genes, Gene Components And Products 

I.D.l. Imbibition and Germination Responsive Genes, Gene Components And 
Products 

LD.2. Early Seedling Phase Genes, Gene Components And Products 
I.D.3. Size and Stature Genes, Gene Components And Products 
I.D.4. Shoot- Apical Meristem Genes, Gene Components And Products 
I.D.5. Vegetative-Phase Specific Responsive Genes, Gene Components And 



II. Hormones Responsive Genes, Gene Components And Products 

II.A. Abscissic Acid Responsive Genes, Gene Components And Products 
II.B. Auxin Responsive Genes, Gene Components And Products 
II.C. Brassinosteroid Responsive Genes, Gene Components And Products 
II.D. Cytokinin Responsive Genes, Gene Components And Products 

II. E. Gibberellic Acid Responsive Genes, Gene Components And Products 

EI. Metabolism Affecting Genes, Gene Components And Products 

m.A. Nitrogen Responsive Genes, Gene Components And Products 

III. B. Circadian Rhythm Responsive Genes, Gene Components And Products 

III.C. Blue Light (Phototropism) Responsive Genes, Gene Components And Products 

IH.D. Co2 Responsive Genes, Gene Components And Products 

HI.E. Mitochondria Electron Transport Genes, Gene Components And Products 

m.F. Protein Degradation Genes, Gene Components And Products 

HLG. Carotenogenesis Responsive Genes, Gene Components And Products 



Products 
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IV. Viability Genes, Gene Components And Products 

IV.A. Viability Genes, Gene Components And Products 

IV. B. Histone Deacetylase (Axel) Responsive Genes, Gene Components And 
Products 

V. Stress Responsive Genes, Gene Components And Products 

V. A. Cold Responsive Genes, Gene Components And Products 
V.B. Heat Responsive Genes, Gene Components And Products 
V.C. Drought Responsive Genes, Gene Components And Products 
V.D. Wounding Responsive Genes, Gene Components And Products 

V.E. Methyl Jasmonate Responsive Genes, Gene Components And Products 

V.F. Reactive Oxygen Responsive Genes, Gene Components And H202 Products 

V.G. Salicylic Acid Responsive Genes, Gene Components And Products 

V.H. Nitric Oxide Responsive Genes, Gene Components And Products 

V.I. Osmotic Stress Responsive Genes, Gene Components And Products 

VJ. Aluminum Responsive Genes, Gene Components And Products 

V.K. Cadmium Responsive Genes, Gene Components And Products 

V.L. Disease Responsive Genes, Gene Components And Products 

V.M. Defense Responsive Genes, Gene Components And Products 

V.N. Iron Responsive Genes, Gene Components And Products 

V.O. Shade Responsive Genes, Gene Components And Products 

V.P. Sulfur Responsive Genes, Gene Components And Products 

V.Q. Zinc Responsive Genes, Gene Components And Products 

VI. Enhanced Foods 

VTL Pharmaceutical Products 

VTQ. Precursors Of Industrial Scale Compounds 

IX. Promoters As Sentinels 
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The present invention comprises polynucleotides, such as complete cDNA sequences 
and/or sequences of genomic DNA encompassing complete genes, fragments of genes, and/or 
5 regulatory elements of genes and/or regions with other functions and/or intergenic regions, 
hereinafter collectively referred to as Sequence-Determined DNA Fragments (SDFs) or 
sometimes collectively referred to as "genes or gene components", or sometimes as "genes, gene 
components or products", from different plant species, particularly corn, wheat, soybean, rice 
and Arabidopsis thaliana, and other plants and or mutants, variants, fragments or fusions of said 

10 SDFs and polypeptides or proteins derived therefrom. In some instances, the SDFs span the 
entirety of a protein-coding segment. In some instances, the entirety of an mRNA is 
represented. Other objects of the invention that are also represented by SDFs of the invention 
are control sequences, such as, but not limited to, promoters. Complements of any sequence of 

O the invention are also considered part of the invention. 

,J;5 Other objects of the invention are polynucleotides comprising exon sequences, 

% polynucleotides comprising intron sequences, polynucleotides comprising introns together with 

fU exons, intron/exon junction sequences, 5' untranslated sequences, and 3' untranslated sequences 

m of the SDFs of the present invention. Polynucleotides representing the joinder of any exons 

L described herein, in any arrangement, for example, to produce a sequence encoding any 

HO desirable amino acid sequence are within the scope of the invention. 

j: The present invention also resides in probes useful for isolating and identifying nucleic 

H acids that hybridize to an SDF of the invention. The probes can be of any length, but more 

typically are 12-2000 nucleotides in length; more typically, 15 to 200 nucleotides long; even 

more typically, 18 to 100 nucleotides long. 
25 Yet another object of the invention is a method of isolating and/or identifying nucleic 

acids using the following steps: 

(a) contacting a probe of the instant invention with a polynucleotide sample under 
conditions that permit hybridization and formation of a polynucleotide duplex; and 

(b) detecting and/or isolating the duplex of step (a). 

30 The conditions for hybridization can be from low to moderate to high stringency 

conditions. The sample can include a polynucleotide having a sequence unique in a plant 
genome. Probes and methods of the invention are useful, for example, without limitation, for 
mapping of genetic traits and/or for positional cloning of a desired fragment of genomic DNA. 
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Probes and methods of the invention can also be used for detecting alternatively spliced 
messages within a species. Probes and methods of the invention can further be used to detect or 
isolate related genes in other plant species using genomic DNA (gDNA) and/or cDNA libraries. 
In some instances, especially when longer probes and low to moderate stringency hybridization 
conditions are used; the probe will hybridize to a plurality of cDNA and/or gDNA sequences of 
a plant. This approach is useful for isolating representatives of gene families which are 
identifiable by possession of a common functional domain in the gene product or which have 
common cis-acting regulatory sequences. This approach is also useful for identifying 
orthologous genes from other organisms. 

The present invention also resides in constructs for modulating the expression of the 
genes comprised of all or a fragment of an SDF. The constructs comprise all or a fragment of 
the expressed SDF, or of a complementary sequence. Examples of constructs include 
ribozymes comprising RNA encoded by an SDF or by a sequence complementary thereto, 
antisense constructs, constructs comprising coding regions or parts thereof, constructs 
comprising promoters, introns, untranslated regions, scaffold attachment regions, methylating 
regions, enhancing or reducing regions, DNA and chromatin conformation modifying 
sequences, etc. Such constructs can be constructed using viral, plasmid, bacterial artificial 
chromosomes (BACs), plasmid artificial chromosomes (PACs), autonomous plant plasmids, 
plant artificial chromosomes or other types of vectors and exist in the plant as autonomous 
replicating sequences or as DNA integrated into the genome. When inserted into a host cell 
the construct is, preferably, functionally integrated with, or operatively linked to, a 
heterologous polynucleotide. For instance, a coding region from an SDF might be operably 
linked to a promoter that is functional in a plant. 

The present invention also resides in host cells, including bacterial or yeast cells or plant 
cells, and plants that harbor constructs such as described above. Another aspect of the invention 
relates to methods for modulating expression of specific genes in plants by expression of the 
coding sequence of the constructs, by regulation of expression of one or more endogenous genes 
in a plant or by suppression of expression of the polynucleotides of the invention in a plant. 
Methods of modulation of gene expression include without limitation (1) inserting into a host 
cell additional copies of a polynucleotide comprising a coding sequence; (2) modulating an 
endogenous promoter in a host cell; (3) inserting antisense or ribozyme constructs into a host 
cell and (4) inserting into a host cell a polynucleotide comprising a sequence encoding a variant 
, fragment, or fusion of the native polypeptides of the instant invention. 
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DETAILED DESCRIPTION OF THE INVENTION 

BRIEF DESCRIPTION OF THE INVENTIONS 

As noted above, the Applicants have obtained and analyzed an extensive amount of 
information on a large number of genes by use of the Ceres Genomic Engine to determine. 
This information can be categorized into three basic types: 

I. Sequence Information for the Inventions 

H^/^ Information for the 

EI. Phenotypic Information for the Inventions 

I. Sequence Information 



To harness the potential of the plant genome, Applicants began by elucidating a large 
number gene sequences, including the sequences of gene components and products, and 
analyzing the data. The list of sequences and associated data are presented in the Reference 
and Sequence Tables of the present application (sometimes referred to as the "REF" and 
"SEQ" Tables). The Reference and Sequence tables include: 



■ cDNA sequence; 

■ coding sequence; 

■ 5'&3'UTR; 

■ transcription start sites; 

■ exon and intron boundaries in genomic sequence; and 

■ protein sequence. 



The Reference and Sequence Tables also include computer-based, comparative 
analyses between the protein sequences of the invention and sequences with known function. 
Proteins with similar sequences typically exhibit similar biochemical activities. The 
Reference table notes: 



■ sequences of known function that are similar to the Applicants' proteins; 
and 

■ biochemical activity that is associated with Applicants' proteins. 



Also, by analyzing the protein sequences, Applicants were able to group the protein 
sequences into groups, wherein all the sequences in the group contain a signature sequence. 
The groups are presented in the Protein Group Table. The signature sequences are reported 



Introduction to Sequence Information 
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in the Protein Group Table. More detailed analyses of the signature sequences are shown in 
the Protein Group Matrix Table. 



To generate the Reference and Sequence Tables, Applicants took a cDNA/coding 
sequence approach. That is, Applicants initiated their studies either by isolating cDNAs and 
determining their sequences experimentally, or by identifying the coding sequence from 
genomic sequence with the aid of predictive algorithms. The cDNA sequences and coding 
sequences also are referred to as "Maximum Length Sequences" in the Reference tables. The 
cDNA and coding sequences were given this designation to indicate these were the maximum 
length of coding sequences identified by Applicants. 

Due to this cDNA/coding sequence focus of the present application, the Reference 
and Sequence Tables were organized around cDNA and coding sequences. Each of these 
Maximum Length Sequences was assigned a unique identifier: Ceres Sequence ID NO, 
which is reported in the Tables. 

All data that relate to these Maximum Length Sequences are grouped together, 
including 5' & 3' UTRs; transcription start sites; exon and intron boundaries in genomic 
sequence; protein sequence, etc. 

Below, a more detailed explanation of the organization of the Reference and 
Sequence Tables and how the data in the tables were generated is provided. 

a. cDNA 

Applicants have ascertained the sequences of mRNAs from different organisms by 
reverse transcription of mRNA to DNA, which was cloned and then sequenced. These 
complementary DNA or cDNA sequences also are referred to as Maximum Length 
Sequences in the Reference Tables, which contain details on each of the sequences in the 
Sequence Tables. 

Each sequence was assigned a Pat. Appln. Sequence ID NO: and an internal Ceres 
Sequence ID NO: as reported in the Reference Table, the section labeled "(Ac) cDNA 
Sequence." An example is shown below: 



IA. 



IDENTIFICATION OF GENE COMPONENTS AND PRODUCTS 



Max Len. Seq. : 
(Ac) cDNA Sequence 
- Pat. Appln. Sequence ID NO: 174538 
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Both numbers are included in the Sequence Table to aid in tracking of information, as 
shown below: 

5 <210> 174538 (Pat. Appln. Sequence ID NO:) 

<211> 1846 
<212> DNA (genomic) 
<213> Arabidopsis thaliana 

10 <220> 

<221> misc_feature 

<222>(1)..(1846) 

<223> Ceres Seq. ID no. 5673127 

J5 <220> 

S <221> miscfeature 

fU <222> 0-0 

m <223> n is a, c, t, g, unknown, or other 

i | 

@0 <400> 174538 

I iJ 

jz acaagaacaa caaaacagag gaagaagaag aagaagatga agcttctggc tctgtttcca 60 

tttctagcga tcgtgatcca actcagctgt. . . etc. 

25 The Sequence and Reference Tables are divided into sections by organism: 

Arabidopsis thaliana, Brassica napus, Glycine max, Zea mays, Triticum aestivum; and Oryza 
sativa. 

b. Coding Sequence 
30 The coding sequence portion of the cDNA was identified by using computer-based 

algorithms and comparative biology. The sequence of each coding sequence of the cDNA is 
reported in the "PolyP Sequence" section of the Reference Tables, which are also divided into 
sections by organism. An example shown below for the peptides that relate to the cDNA 
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sequence above 

PolyP Sequence 

- Pat. Appln. Sequence ID NO 174539 

- Ceres Sequence ID NO 5673128 

- Loc. Sequence ID NO 174538: @ 1 nt. 

- Loc. Sig. P. Sequence ID NO 174539: @ 37 aa. 

The polypeptide sequence can be found in the Sequence Tables by either the Pat. Appln. 
Sequence ID NO or by the Ceres Sequence ID NO: as shown below: 

<210> 174539 (Pat. Appln. Sequence ID NO) 

<211>443 

<212> PRT 

<213> Arabidopsis thaliana 
<220> 

<221> peptide 
<222>(1)..(443) 

<223> Ceres Seq. ID no. 5673128 

<220> 

<221> misc_feature 
<222>0-.0 

<223> xaa is any aa, unknown or other 
<400> 174539 

Thr Arg Thr Thr Lys Gin Arg Lys Lys Lys Lys Lys Met Lys Leu Leu 
15 10 15 

Ala Leu Phe Pro Phe Leu Ala He. . . etc. 
25 
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The PolyP section also indicates where the coding region begins in the Maximum 
Length Sequence. More than one coding region may be indicated for a single polypeptide due to 
multiple potential translation start codons. Coding sequences were identified also by analyzing 
genomic sequence by predictive algorithms, without the actual cloning of a cDNA molecule 
from a mRNA. By default, the cDNA sequence was considered the same as the coding 
sequence, when Maximum Length Sequence was a spliced together from a genomic annotation. 

c. 5' and 3' UTR 
The 5' UTR can be identified as any sequence 5' of the initiating codon of the coding 
sequence in the cDNA sequence. Similarly, the 3' UTR is any sequence 3' of the terminating 
codon of the coding sequence. 



d. Transcription Start Sites 
Applicants cloned a number of cDNAs that encompassed the same coding sequence but 
comprised 5' UTRs of different lengths. These different lengths revealed the multiple 
transcription start sites of the gene that corresponded to the cDNA. These multiple transcription 
start sites are reported in the "Sequence # w. TSS" section" of the Reference Tables. 



e. Exons & Introns 
Alignment of the cDNA sequences and coding portions to genomic sequence permitted 
Applicants to pinpoint the exon/intron boundaries. These boundaries are identified in the 
Reference Table under the "Pub gDNA" section. That section reports the gi number of the 
public B AC sequence that contains the introns and exons of interest. 
Max Len. Seq. : 
Pub gDNA: 

giNo: 1000000005 
Gen. seq. in cDNA: 



115777 .. 


. 115448 by Method #1 


115105 .. 


. 114911 by Method #1 


114822 .. 


. 114700 by Method #1 


114588 .. 


. 114386 by Method #1 


114295 .. 


. 113851 by Method #1 


115777.. 


. 11 5448 by Method #2 
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115105 .. 


. 114911 by Method #2 


114822 .. 


. 114700 by Method #2 


114588 .. 


. 114386 by Method #2 


114295 .. 


. 113851 by Method #2 


115813 .. 


. 115448 by Method #3 


115105 .. 


. 114911 by Method #3 


114822 .. 


. 114700 by Method #3 


114588 .. 


. 114386 by Method #3 


114295 .. 


. 113337 by Method #3 



(Ac) cDNA Sequence 

All the gi numbers were assigned by Genbank to track the public genomic sequences 

except: 

gi 1000000001 
gi 1000000002 
gi 1000000003 
gi 1000000004; and 
gi 1000000005. 

These gi numbers were assigned by Applicants to the five Arabidopsis chromosome 
sequences that were published by the Institute of Genome Research (TIGR). Gi 1000000001 
corresponds to chromosome 1, Gi 1000000002 to chromosome 2, etc. 

The method of annotation is indicated as well as any similar public annotations. 



Promoter sequences are 5' of the translational start site in a gene; more typically, 5' of the 
transcriptional start site or sites. Terminator sequences are 3' of the translational terminator 
codon; more typically, 3' of the end of the 3' UTR. 

For even more specifics of the Reference and Sequence Tables, see the section below titled 
"Brief Description of the Tables " 



f. 



Promoters & Terminators 



n. TRANSCRIPTIONAL (DIFFERENTIAL EXPRESSION) INFORMATION 
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Introduction to Differential Expression Data & Analyses 
A major way that a cell controls its response to internal or external stimuli is by 
regulating the rate of transcription of specific genes. For example, the differentiation of cells 
during organogenesis into forms characteristic of the organ is associated with the selective 
activation and repression of large numbers of genes. Thus, specific organs, tissues and cells are 
functionally distinct due to the different populations of mRNAs and protein products they 
possess. Internal signals program the selective activation and repression programs. For 
example, internally synthesized hormones produce such signals. The level of hormone can be 
raised by increasing the level of transcription of genes encoding proteins concerned with 
hormone synthesis. 

To measure how a cell reacts to internal and/or external stimuli, individual mRNA levels 
can be measured and used as an indicator for the extent of transcription of the gene. Cells can be 
exposed to a stimulus, and mRNA can be isolated and assayed at different time points after 
stimulation. The mRNA from the stimulated cells can be compared to control cells that were not 
stimulated. The mRNA levels of particular Maxiumum Length Sequences that are higher in the 
stimulated cell versus the control indicate a stimulus-specific response of the cell. The same is 
true of mRNA levels that are lower in stimulated cells versus the control condition. 
Similar studies can be performed with cells taken from an organism with a defined mutation in 
their genome as compared with cells without the mutation. Altered mRNA levels in the mutated 
cells indicate how the mutation causes transcriptional changes. These transcriptional changes 
are associated with the phenotype that the mutated cells exhibit that is different from the 
phenotype exhibited by the control cells. 

Applicants have utilized microarray techniques to measure the levels of mRNAs in cells 
from mutant plants, stimulated plants, and/or selected from specific organs. The differential 
expression of various genes in the samples versus controls are listed in the MA_diff Tables. 
Applicants have analyzed the differential data to identify genes whose mRNA transcription 
levels are positively correlated. From these analyses, Applicants were able to group different 
genes together whose transcription patterns are correlated. The results of the analyses are 
reported in the MA_clust Tables. 

a. Experimental Detail 
A microarray is a small solid support, usually the size of a microscope slide, onto which 
a number of polynucleotides have been spotted onto or synthesized in distinct positions on the 
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slide (also referred to as a chip). Typically, the polynucleotides are spotted in a grid formation. 
The polynucleotides can either be Maximum Length Sequences or shorter synthetic 
oligonucleotides, whose sequence is complementary to specific Maximum Length Sequence 
entities. A typical chip format is as follows: 



10 



20 



01igo#l 


01igo#2 


Oligo #3 


01igo#4 


Oligo #5 


Oligo #6 


Oligo #7 


Oligo #8 


Oligo #9 



For Applicants' experiments, samples were hybridized to the chips using the "two-color" 
microarray procedure. A fluorescent dye was used to label cDNA reverse-transcribed from 
mRNA isolated from cells that had been stimulated, mutated, or collected from a specific organ 
or developmental stage. A second fluorescent dye of another color was used to label cDNA 
prepared from control cells. 

The two differentially-labeled cDNAs were mixed together. Microarray chips were 
incubated with this mixture. For Applicants' experiments the two dyes that are used are Cy3, 
which fluoresces in the red color range, and Cy5, which fluoresces in the green/blue color range. 
Thus, if: 

cDNA#l binds to Oligo #1; 

cDNA#l from the sample is labeled red; 

cDNA#l from the control is labeled green, and 

cDNA#l is in both the sample and control, 

then cDNA#l from both the sample and control will bind to 01igo#l on the chip. If the sample 
has 10 times more cDNA#l than the control, then 10 times more of the cDNA#l would be 
hybridized to 01igo#l. Thus, the spot on the chip with 01igo#l spot would look red. 



25 





Oligo #2 


Oligo #3 


Oligo #4 


Oligo #5 


Oligo #6 


Oligo #7 


Oligo #8 


Oligo #9 



If the situation were reversed, the spot would appear green. If the sample has approximately the 
same amount of cDNA#l as the control, then the 01igo#l spot on the chip would look yellow. 
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These color differentials are measured quantitatively and used to deduce the relative 
concentration of mRNAs from individual genes in particular samples. 



b. 



MA Diff Data Table 



To generate data, Applicants labeled and hybridized the sample and control mRNA in duplicate 
experiments. One chip was exposed to a mixture of cDNAs from both a sample and control, 
where the sample cDNA was labeled with Cy3, and the control was labeled with Cy5 dye. For 
the second labeling and chip hybridization experiments, the fluorescent labels were reversed; 
that is, the Cy5 dye for the sample, and the Cy3 dye for the control 

Whether Cy5 or Cy3 was used to label the sample, the fluorescence produced by the 
sample was divided by the fluorescence of the control. A cDNA was determined to be 
differentially expressed in response to the stimulus in question if a statistically-significantly 
ration difference in the sample versus the control was measured by both chip hybridization 
experiments. 

The MA_diff tables show which cDNA were significantly up-regulated as designated by 
a "+" and which were significantly down-regulated as designated by a "-" for each pair of chips 
using the same sample and control. 

Ill PHENOTYPIC INFORMATION 

One means of determining the phenotypic effect of a gene is either to insert extra active 
copies of the gene or coding sequence, or to disrupt an existing copy of the gene in a cell or 
organism and measure the effects of the genetic change on one or more phenotypic characters or 
traits. "Knock-in" is used herein to refer to insertion of additional active copies of a gene or 
coding sequence. "Knock-out" refers to a plant where an endogenous gene(s) is disrupted. 
Applicants have used both methods of addition or disruption to determine the phenotypic effects 
of gene or gene components or products, and have thereby discovered the function of the genes 
and their utilities. 

1. Knock-in results 
The coding sequence of a desired protein can be functionally linked to a heterologous promoter 
to facilitate expression. Here, Applicants have operably linked a number of coding sequences to 
either one of the promoters listed below: 
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GFP Pattern 


Specific Promoter 


Plant Line 




activity 


Descriptor 


Root epidermis / mostly toward the lower 
region of root (more intense than CS9094) 


Specific to the root basal 
region. 


Root basal 


Root-endodermis/cortex (initials sharp); 
shoot-mesophyll of one leaf, sharp guard cell 
marking. New leaf petioles near tip of 
primary inflorescence; floral stems; in 
flowers at base of sepal, anther stems, and 
pistil 


Specific to the root 
endodermis-cortex 
region, leaf petiole, and 
flowers. 


Root/Petiole/Flowers 


Broad root exp. (some dermal, some cortical, 
some vascular); shoot apex. Faintly in 
petiole; stem 


Specific to root and stem. 


Root/Steml 


High expression in stem, excluded from 1st 
true leaves/High in root. Faint expression in 
stem 


Specific to stem and root. 


Root/Stem2 


Shoot meristem / whole root region; little bit 
on cotyledons. Base of leaves(axillary 
meristem?); base of sepals; inflorescence 
meristem; small amount in unfertilized pistil. 


Specific to roots, shoot 
meristem, base of leaves 
and flowers. 


Root/Stem/Leaves/Fl 
owers 


root tip vascular initials; vascular system 
throughout plant; Bud petal vasculature and 
pistil septum; Flower petal vascualture; 
Flower pistil septum; Pre fertilization ovules; 
Post fertilization ovule at chalazal end; 
Developing seed (young, maturing siliques); 
Seed coat and young embryos. GFP not 
observed in mature embryos. 


Specific to vascular 
systems. 


Vascular/Ovule/Y ou 
ng Seed/Embryo 


Flower, sepal / vascular tissue of root, stem, 
and cotyledons. Stems of new flowers; 
vasculature or petals, anthers, sepals, and 
pistil/silique; Vasculature throughtout 
seedling: root, hypocotyl, petioles, stem, 
cotyledons, first true leaves; Rosette 
vasculature; Cauline leaf vasculature; Bud 
pedicel vasculature; Flower vasculature: 
(sepals, petals, filaments, pistil); Bud 
vasculature (sepal,petal, filament, pistil); 
Funiculus in both flower and bud; Some 
possible seed coat expression; Silique 
funiculus; Very faint fluorescence in mature 
embryo (auto fluorescence perhaps); 


Specific to flowers, seed 
and vasculature. 


Flowers/Seed/Vascul 
ature/Embryo 


Ro6t expression - primarily in cortex (upper 
refion of the root). No shoot expression 


Specific to root. 


Roots2 


Root expression - less intense in whole root 
of young seedling. Shoot apical meristem; 
organ primordia in SAM region. 


Specific to root and shoot 
apical meristem. 


Root/SAM 
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GFP Pattern 


Specific Promoter 


Plant Line 




activity 


Descriptor 


Root epidermis/tip; shoot epidermis/vascular; 
leaf epidermis; expression in developing 
seed/ovule - mature embryo; Primary and 
lateral root cortex; Very strong in root cap; 
Base of flower bud and epidermis of carpels; 
Base of flower, epidermis of filaments, 
epidermis of carpels; Trichomes; Weak 
(hardly detectable) gfp expression in 
vasculature throughout seedling; Strong 
expression in trichomes; POST- fertilization 
SEED only; GFP strength increases as 
silique matures; Weak at suspensor end of 
the embryo; GFP observed in seed coat; Root 
and post fertilization seed specific gfp 
expression; Expression in seed coat. 


Specific to seed and to 
epidermal layers of roots, 
shoots and leaves. 


Seed/Epidermis/Ova 
ry/Fruit 


Young root dermis; dermal/cortical?/vascular 
in older root;general (epidermal?) shoot 
expression; ovules, some in sepals; 
vasculature of stem 


Specific to roots, shoots, 
and ovules. 


Roots/Shoots/Ovule 


Vascular tissue of root; Meristem tissues: 
axillary meristems, floral meristems, base of 
flowers/sepals; Weak expression in 
hypocotyl, petiole and cotyledon 
vasculature.. 


Specific to root structural 
leaf vascular region and 
to floral buds and axillary 
meristem 


Vasculature/Meriste 
m 



The chimeric constructs were transformed into Arabidopsis thaliana. The resulting transformed 
lines were screened to determine what phenotypes were changed due to introduced transgene. 
The phenotype changes, relative to the control, are reported in the Knock-in tables. 

2. Knock-out Results 

Knock-out plants in Arabidopsis thaliana were created by inserting a polynucleotide tag into 
the genome. The location of the tag was identified using primers to the tag sequence and isolation of 
the plant genomic sequence that flanks the tag using a variation of the polymerase chain reaction. The 
plants were generated using the procedure described in Feldmann et al., (1987) Molec. Gen. Genet. 
208: 1-9; Feldmann (1991) Plant Journal, 1:71-83 and Forsthoefel et al., (1992) Aust. J. Plant Physiol. 
19:353-366.. On average, the population of plants that was screened had -1.5 to 2 tags. Generally, the 
number of tags ranged from 1 to greater than 5. 

The polynucleotide tags were classified as either incorporated within a gene, or between 
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two genes. The data in the Knock-out Table indicates which plants have a tag(s) causing a 
disruption in a gene, or a disruption between genes. 

a. Disruption in a Gene 
For the sake of this analysis, the tag was considered to be causing a disruption in a 
gene when the tag was located: 

1) less than 501 upstream of the transcriptional start site; 

2) less than 701 upstream of the translational initiation codon; 

3) between the translational initiation and termination codons of the gene, 

4) less than 301 downstream of the translational stop codon; or 

5) less than 151 downstream of a transcriptional termination site. 



By this definition, a tag can be inserted in two genes. For example, if two genes have 
only 700 nucleotides between the translational termination codon of one gene and the 
translational initiation codon of the other gene, the tag can be inserted into the terminator of 
one gene and the promoter of the other gene according to the definition above. 

Genomic annotations by the method OCKHAM-OCDNA identify the transcriptional 
start and stop site of a gene. 

b. Disruption between Genes 

When a tag causes a disruption between two genes, either or both genes can be 
affected. Typically, a tag can affect a gene if it disrupts the genome at a location 3000 nt 
downstream to the start codon of a gene. More typically, insertions found 1000- 2000 nt 
upstream (5'), or 750-1000 nt downstream (3') could be expected to disrupt expression. 

c. More Than One Insert 

A plant can have multiple tags. If a mutant phenotype is observed, then it can be attributed to 
any one or all of the tags. 
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HOW THE INVENTIONS REVEAL HOW GENES, GENE COMPONENTS AND 
PRODUCTS FUNCTION 

The different experimental molecular genetic approaches focused on different aspects of 
genes, gene components, and gene products of the inventions. The variety of the data 
demonstrates the multiple functions and characteristics of single genes, gene components, and 
products. The data also explain the pathways and networks in which individual genes and 
products participate and interact. As a result, the circumstances or conditions are now known 
when these genes and networks are active. These new understandings of biology are relevant 
for many plant species. The following section describes the process by which Applicants 
analyzed the inventions generated by the Ceres Genomic Engine: 

TABLE OF CONTENTS 

I. Experimental Results Reveal Many Facets Of A Single Gene 

LA. Intro: Functions And Characteristics Of Gene Components and Products Are 
Identified 

LA. 1 . Functions Of Coding Sequences Revealed By The Ceres Genomic 
Engine 

A. 1 .a. Sequence Similarity To Proteins Of Known Function Can Be 



Used To Associate Biochemical Activities And Molecular 



Interaction To The Proteins Of The Invention 



1 . Presence Of Amino Acid Motifs Indicates Biological 



Function 



2. Related Amino Acid Sequences Share Similar 
Biological Functions 



A. Lb. 



Differential Expression Results Explain In Which Cellular 



Responses The Proteins Of The Invention Are Involved 

1. The Transcript Levels Of A Protein Over Time In 
Response To A Stimuli Are Revealed By Transcriptional 
Analyses Over Many Experiments. 

2. The Transcript Levels Of A Protein Over Different 
Developmental Stages Can Be Identified By Transcriptional 
Analyses Over Many Experiments 
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3 . Proteins That Are Common In A Number Of Similar 
Responses Can Be Identified By Transcriptional Analyses 
Over A Number Of Experiments 

4. Proteins That Are Common To Disparate Responses 
Can Be Identified By Transcriptional Analyses Over A 
Number Of Experiments 



A. 1 .c. Observations Of Phenotypic Changes Show What 

Physiological Consequences Applicants' Proteins Can Produce 

LA.2. Specificity Of Regulatory Sequence Are Uncovered By The Ceres 
Genomic Engine 

A.2.a. Differential Expression Results Explain Which External Or 
Internal Stimuli Trigger The Regulatory Sequences 

Experimental Results Also Reveal Pathways Or Networks Of Genes 
II.A. Genes Whose Transcription Are Well Coordinated Generally Act Together To 
Produce Proteins That Participate In The Same Pathway Or Network 
II.A.l. Calculating The Correlation Coefficient Between Pairs Of Genes 

Based On The Differential Expression Data 
ILA.2. The Complete Linkage Analyses of Differential Identity Genes With 

Similar Pattern Of Transcription 
n.A.3. The Nearest Neighbor Analyses of Differential Group Genes With 



Experimental Results Reveal The Function And Characteristic Genes, Pathways And 
Networks 

m. A. Linking Biochemical Or Metabolic Activities Of One Protein In A Cluster To 

The Other Proteins In The Same Microarray Cluster 
m.B. Using Differential Expression Data To Determine When The Genes And 

Pathways Are Active 

m.C. Using Phenotype Data To Determine When Genes And Pathways Are Active 



Correlated But Dissimilar Transcription Profiles 
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IV. Experimental Results Provide An Understanding Of Genes, Pathways and Networks 
In Many Plant Species 

IV. A. Intro: Using Protein Sequence Similarity To Assign Function From 



Arabidopsis Proteins To Proteins Of Other Species 

IV.A.l. Conserved Sequences Between Proteins Of Different Species Give 

Rise To A Signature Sequence 
IV. A.2. Linking Signature Sequences To Conservation Of Structural 

Characteristics 

IV.A.3. Linking Signature Sequences To Conservation Of Biochemical 

Activities And Molecular Interactions 
IV.A.4. Linking Signature Sequences To Conservation Of Cellular 

Responses Or Activities 
IV.A.5. Linking Signature Sequences To Conservation Of Phenotypic Effects 



I. EXPERIMENTAL RESULTS REVEAL MANY FACETS OF A SINGLE GENE 

LA. INTRO: FUNCTIONS AND CHARACTERISTICS OF GENE 
COMPONENTS AND PRODUCTS ARE IDENTIFIED 

The experimental results are used to dissect the function of individual components and 
products of the genes. For example, the biochemical activity of the encoded protein could be 
surmised from sequence analyses, and promoter specificity could be identified through 
transcriptional analyses. Generally, the data presented herein can be used to functionally 
annotate either the protein sequence and/or the regulatory sequence that control transcription and 
translation. 



LA. 1 . FUNCTIONS OF CODING SEQUENCES REVEALED BY THE 
CERES GENOMIC ENGINE 

A. 1 .a. SEQUENCE SIMILARITY TO PROTEINS OF KNOWN 
FUNCTION CAN BE USED TO ASSOCIATE 
BIOCHEMICAL ACTIVITIES AND MOLECULAR 
INTERACTION TO THE PROTEINS OF THE INVENTION 



Page 47 of 772 



Attorney Diet 2730-148.51' 
Client Dkt. 80090.005 



The protein sequences of the invention were analyzed to determine if they shared any 
sequence characteristics with proteins of known activity. Proteins can be grouped together 
based on sequence similarity, either localized or throughout the length of the proteins. 
Typically, such groups of proteins exhibit common biochemical activities or interact with 
similar molecules. 



Localized protein sequence similarity, also referred to as amino acid motifs, have been 
attributed to enzyme or protein functions. A library of motifs, important for function, have been 
documented in PROSITE, a public database available at http://www.expasy.ch/prosite/. This 
library includes descriptions of the motifs and their functions. The zinc finger motif is one such 
entry in PROSITE, which reports that the zinc finger domain of DNA-binding proteins is 
typically defined by a 25-30 amino acid motif containing specific cysteine or histidine residues 
that are involved in the tetrahedral coordination of a zinc ion. Any protein comprising a 
sequence similar to the zinc finger amino acid motif will have similar functional activity 
(specific binding of DNA). 

Protein sequences of the invention have been compared to a library of amino acid motifs 
in the pFAM database, which is linked to the PROSITE database. If any of Applicants' protein 
sequences exhibit similarity to these amino acid motifs or domains, the Reference Table notes 
the name and location of the motif in the "Pred. PP Nom. & Annot" section of the Reference 
tables. A description of any biochemical activities that are associated to these domains, and 
therefore associated with Applicants' proteins, is included in the Protein Domain table. 

For example, polypeptide, CERES Sequence ID NO: 1545823 is associated with zinc 
finger motif as follows in the Reference Table: 



It is apparent, when studying protein sequence families, that some regions have been 



1. PRESENCE OF AMINO ACID MOTIFS INDICATES 



BIOLOGICAL FUNCTION 



(C) Pred. PP Nom. & Annot. 

- Zinc finger, C3HC4 type (RING finger) 

- Loc. Sequence ID NO 133059: 58 -> 106 aa. 



2. RELATED AMINO ACID SEQUENCES SHARE 
SIMILAR BIOLOGICAL FUNCTIONS 
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better conserved than others during evolution. These regions are generally important for the 
function of a protein and/or for the maintenance of its three-dimensional structure. 

The Reference Table reports in section "(Dp) Rel. AA Sequence" when a protein shares 
amino acid similarity with a protein of known activity. The section reports the gi number of the 
protein of known activity, a brief description of the activity, and the location where it shares 
sequence similarity to Applicants' polypeptide sequence. 

Using this analysis, biochemical activity of the known protein is associated with 
Applicants' proteins. An example for the polypeptide described above is as follows: 
(Dp) Rel. AA Sequence 

- Align. NO 524716 

- gi No 2502079 

- Desp. : (AF022391) immediate early protein; ICP0 [Feline herpesvirus 1] 

- % Idnt. : 33.7 
-Align. Len.: 87 

- Loc. Sequence ID NO 133059: 52 -> 137 aa. 



Differential expression results show when the coding sequence is transcribed, and 
therefore when the activity of the protein is deployed by the cell. Similar coding sequences can 
have very different physiological consequences because the sequences are expressed at different 
times or places, rather than because of any differences in protein activity. Therefore, modified 
levels (increased or decreased) of expression as compared to a control provide an indication of 
the function of a corresponding gene, gene components, and gene products. 

These experiments can determine which are genes "over-expressed" under a given 
stimulus. Such over-expressed genes give rise to higher transcript levels in a plant or cell that is 
stimulated as compared to the transcript levels of the same genes in a control organism or cell. 
Similarly, differential expression experiments can reveal "under-expressed" genes. 

To increase the cellular response to a stimulus, additional copies of the coding sequences 
of a gene that is over-expressed are inserted into a cell. Increasing transcript levels of an over- 
expressed gene can either heighten or prolong the particular cellular response. A similar 



A. 1 .b. DIFFERENTIAL EXPRESSION RESULTS EXPLAIN IN 



WHICH CELLULAR RESPONSES THE PROTEINS OF THE 



INVENTION ARE INVOLVED 
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enhancement can occur when transcription of an under-expressed gene is inhibited. In contrast, 
the cellular response will be shortened or less severe when the over-expressed genes are 
inhibited or when expression of the under-expressed genes are increased. 

In addition to analyzing the levels of transcription, the data were also analyzed to gain 
insight into the changes in transcription over time. That is, while the plants in the experiments 
were reacting to either an external or internal stimulus, a differential experiment takes a snapshot 
of the transcription levels in the cells at one specific time. However, a number of snap-shots can 
be taken at different time points during an external stimulus regime, or at different stages of 
development during an internal stimulus. These results show how the plant changes 
transcription levels over time, and therefore protein levels in response to specific stimuli to 
produce phenotypic changes. These results show that a protein can be implicated in a single, but 
more likely, in a number of cellular responses. 



Applicants produced data from plants at different times after a specific stimulus. These 
results show whether the expression level of a gene spikes at a key moment during the cellular 
response, or whether the transcript level remains constant. Thus, coding sequences not only can 
be determined to be over- or under-expressed, but also can be classified by the initial timing and 
duration of differential expression. This understanding of timing can be used to increase or 
decrease any desired cellular response. 

Generally, Applicants have assayed plants at 2 to 4 different time points after exposing 
the plants to the desired stimuli. From these experiments, "early" and "late" responders were 
identified. These labels are applied to either the regulatory sequences driving transcription of 
the gene as well as to the protein encoded by the gene. 

The following example illustrates how the genes, gene components and products were 
classified as either early or late responders following a specific. The mRNAs from plants 
exposed to drought conditions were isolated 1 hour and 6 hours after exposure to drought 
conditions. These mRNAs were tested utilizing microarray techniques. The graph below 
illuminates possible transcription profiles over the time course, plotting all the (+) data points as 
+1 and all the (-) data points as -1 : 



1. THE TRANSCRIPT LEVELS OF A PROTEIN OVER 



TIME IN RESPONSE TO A STIMULI ARE REVEALED BY 



TRANSCRIPTIONAL ANALYSES OVER MANY 



EXPERIMENTS 
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cDNA - early responder type 1 
cDNA - early responder type 2 
cDNA - late responder type 1 
X — cDNA - late responder type 2 




(The value for each time point was determined using a pair of microarray chips as 
described above.) 



Data acquired from this type of time course experiment are useful to understand how one 
may increase or decrease the speed of the cellular response. Inserting into a cell extra copies of 
the coding sequence of early responders in order to over-express the specific gene can trigger a 
faster cellular response. Alternatively, coding sequences of late responders that are over- 
expressed can be placed under the control of promoters of early responders as another means to 
increase the cellular response. 

Inserting anti-sense or sense mRNA suppression constructs of the early responders that 
are over-expressed can retard action of the late responders, thereby delaying the desired cellular 
response. In another embodiment, extra copies of the promoters of both early and late 
responders can be added to inhibit expression of both types of over-expressed genes. 

The experiments described herein can be grouped together to determine the time course 
of the transcript levels of different coding sequences in response to different stimuli. Examples 
of different groups are as follows (the examples include the IDs for both corn and Arabidopsis 
experiments): 

. NAA (EXPT IDs 108564, 108565, 108516, 108554) 
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BA(EXPT IDs 108566, 108567, 108517) 

GA(EXPTIDs 108562, 108563, 108519, 108520, 108521, 108484, 108485, 
108486) 

BR(EXPTIDs 108580, 108581, 108557, 108478, 108479, 108480, 108481) 
ABA(EXPT IDs 108560, 108561, 108513, 108597) 

Drought (EXPT IDs 108572, 108573, 108502, 108503, 108504, 108556, 108482, 

108483, 108473, 108474, 108477) 

Cold (EXPT IDs 108578, 108579, 108533, 108534) 

Heat (EXPT IDs 108576, 108577, 108522, 108523) 

Osmotic stress (EXPT IDsl08570, 108571, 108541, 108542, 108553, 108539, 
108540) 

Reactive Oxygen (EXPT IDs 108582, 108583, 108537, 108538, 108558) 
NO (EXPT IDs 108584, 108585, 108526, 108527, 108559) 
Wounding (EXPT IDs 108574, 108575, 108524, 108525) 

SA (EXPT IDs 108586, 108587, 108515, 108552, 108471, 108472, 108469, 108470, 

107953, 107960, 108443, 108440, 108441, 108475, 108476) 

MeJA (EXPT IDs 108568, 108569) 

Finale (EXPT IDs 108467, 107871, 107876) 

Trimec (EXPT IDs 108466, 107886, 107891) 

Round-up (EXPT IDs 108465, 107896) 

Glean (EXPT IDs 108468, 107881) 



2. THE TRANSCRIPT LEVELS OF A PROTEIN OVER 
DIFFERENT DEVELOPMENTAL STAGES CAN BE 
IDENTIFIED BY TRANSCRIPTIONAL ANALYSES OVER 
MANY EXPERIMENTS 
Differential expression data were produced for different development stages of various 
organs and tissues. Measurement of transcript levels can divulge whether specific genes give 
rise to spikes of transcription at specific times during development, or whether transcription 
levels remain constant. This understanding can be used to increase speed of development, or to 
arrest development at a specific stage. 

Like the time-course experiments, the developmental stage data can classify genes as 
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being transcribed at early or late stages of development. Generally, Applicants assayed different 
organs or tissues at 2-4 different stages. 

Inhibiting under-expressed genes at either early or late stages can trigger faster 
development times. The overall development time also can be increased by this means to allow 
organs and tissue to grow to a larger size or to allow more organs or tissues to be produced. 
Alternatively, coding sequences of late stage genes that are under-expressed can be placed under 
the control of promoters of early stage genes to increase heighten development. 

Inserting extra copies of the coding sequence early stage genes that are under-expressed 
can retard action of the late-stage genes and delay the desired development. 

Fruit development of Arabidopsis is one example that can be studied. Siliques of 
varying sizes, which are representative of different stages, were assayed by microarray 
techniques. Specifically, mRNA was isolated from siliques between 0-5mm, between 5- 10mm 
and > 10 mm in length. The graph below shows expression pattern of a cell wall synthesis gene, 
cDNAID 1595707, during fruit development: 

The developmental course shows that the gene encoding a cell wall synthesis protein is 
up-regulated when the fruit is 0-5mm but returns to normal levels at 5- 10mm and > 10mm. 
Increase of cell wall synthesis can lead to larger cells and/or greater number of cells. This type 
of increase can boost fruit yield. The coding sequence of the cell wall synthesis protein under 
the control of a strong early stage promoter would increase fruit size or number. 



+ 




cDNAJD 1595707 

Cell Wall Synthesis 
Gene 



0-5 mm 



5-10 mm 



> 10 mm 



A pectinesterase gene was also differentially expressed during fruit development, cDNA 
ID 1396123. Pectinesterase catalyzes the hydrolysis of pectin into pectate and methanol. This 
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biochemical activity plays an important role in cell wall metabolism during fruit ripening. To 
shorten the time for fruit ripening, extra copies of this gene with its endogenous promoter can be 
inserted into a desired plant. With its native promoter, the extra copies of the gene will be 
expressed at the normal time, to promote extra pectinesterase at the optimal stage of fruit 
5 development thereby shortening ripening time. 

A number of Applicant's experiments can be grouped together to study changes of 
transcript levels over a number development stages. Below are examples of groups of 
experiments: 

10 • Root, Root Tip, and rhl mutant (EXPT IDs 108594, 108433, 108599, 108434, 

108439) 

. Flowers Drought Exposed Flowers, SA Treated Flowers (EXPT IDs 108473, 
108474, 108429, 108430, 108431, 108475, 108476, 108501) 
O • BR Shoot Apices, Leaves, Stm (EXPT IDs 108478, 108479, 108480, 108481, 

js 108598, 108535, 108536, 108435) 

f . Leaf and Stm (EXPT IDs 108477, 108512, 108497, 108498, 108598108478, 

fa* 

FU 108479, 108480, 108481, 108598, 108535, 108536, 108435) 

J . Imbibded & Germinating Seeds 1, 2, 3, And 4 Days (EXPT IDs 108461, 108462, 

L, 108463, 108464, 108528, 108529, 108530, 108531, 108545, 108546, 108547, 

lo 108518, 108529, 108543, 108544) 

^ • Tissue Specific Expression (3 week rosette leaves, Tissue Specific Expression (3 

week stems), Tissue Specific Expression (2 week roots) (EXPT IDs 1 08497, 1 08498, 
108439) 

• Tissue Specific Expression (3 week rosette leaves), Germinating Seeds (EXPT IDs 
25 108497, 108461) 

• Tissue Specific Expression (3 week rosette leaves, stm mutants, BRJShoot Apices 
Expt, root tips, Tissue Specific Expression (2 week roots) (EXPT IDs 108497, 
108435, 108480, 108434, 108439) 

• BRJShoot Apices Expt, root tips, Tissue Specific Expression (flower buds) (EXPT 
30 IDs 108480, 108434, 108431) 

• Arab Ler-pi ovule l, ap2-floral buds, Tissue Specific Expression (flower buds), 
Tissue Specific Expression (<5 mm siliques) (EXPT IDs 108595, 108501, 108431, 
108437) 
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Tissue Specific Expression (2 week roots), rhl mutant2, BR Shoot Apices Expt, 
Trichome Inflorescences (EXPT IDs 108439, 108433, 108480, 108452) 




3. PROTEINS THAT ARE COMMON IN A NUMBER OF 
SIMILAR RESPONSES CAN BE IDENTIFIED BY 
10 TRANSCRIPTIONAL ANALYSES OVER A NUMBER OF 

EXPERIMENTS 

The differential expression experiments also reveal the genes, and therefore the coding 
sequence, that are common to a number of cellular responses. By identifying the genes that are 
D differentially expressed in a number of similar responses, the genes at the nexus of a range of 

Ji5 responses are discovered. For example, genes that are differentially expressed in all the stress 
%l responses are at the hub of many of the stress response pathways. 

ry These types of nexus genes, proteins, and pathways are differentially expressed in many 

In 

g-j or majority of the responses or developmental conditions of interest. Typically, a nexus gene, 
!L protein, or pathway is differentially expressed in generally the same direction in many or 
CgO majority of all the desired experiments. By doing so, the nexus gene can be responsible for 
'% triggering the same or similar set of pathways or networks for various cellular responses. This 
p type of gene is useful in modulating pleiotropic effects or triggering or inhibiting a general class 
of responses. 

When nexus genes are differentially expressed in a set of responses, but in different 
25 directions, these data indicate that a nexus gene is responsible for creating the specificity in a 
response by triggering the same pathway but to a different degree. Placing such nexus genes 
under a constitutive promoter to express the proteins at a more constant level can remove the 
fluctuations. For example, a plant that is better drought adapted, but not cold adapted can be 
modified to be tolerant to both conditions by placing under the control of a constitutive promoter 
30 a nexus gene that is up-regulated in drought but down regulated in cold. 

Applicants' experiments can be grouped together to identify such nexus genes. 
Examples of these groups are as follows: 

• Herbicide Response 
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Trimec, Finale, Glean, Round-up (EXPT IDs 108467, 107871, 107876, 108468, 
107881, 108465, 107896, 108466, 107886, 107891) 
• Stress Response 

. Drought, Cold, Heat, Osmotic Stress (EXPT IDs 108578, 108579, 108533, 
5 108534, 108572, 108573, 108502, 108503, 108504, 108556, 108482, 108483, 

108473, 108474, 108477, 108576, 108577, 108522, 108523, 108570, 108571, 
108541, 108542, 108553, 108539, 108540) 

. Drought, Cold, Heat, PEG, Trimec, Finale, Glean, Round-up (EXPT IDs 
10 108578, 108579, 108533, 108534, 108572, 108573, 108502, 108503, 108504, 

108556, 108482, 108483, 108473, 108474, 108477, 108576, 108577, 108522, 
108523, 108570, 108571, 108541, 108542, 108553, 108539, 108540) 
. Wounding, SA, MeJA, Reactive Oxygen, NO (EXPT IDs 1 08568, 108569, 
C 108555, 108584, 108585, 108526, 108527, 108559, 108582, 108583, 108537, 

iS : 

JjS 108538, 108558, 108586, 108587, 108515, 108552, 108471, 108472, 108469, 

f 108470, 107953, 107960, 108443, 108440, 108441, 108475, 108476, 108574, 

FU 108575, 108524, 108525) 

rp • Hormone Responses 

I, . NAA, BA, BR, GA, TRIMEC (EXPT IDs 108566, 108567, 108517, 108580, 

fflO 108581, 108557, 108478, 108479, 108480, 108481, 108562, 108563, 108519, 

% 108520, 108521, 108484, 108485, 108486, 108564, 108565, 108516, 108554, 

O 108466, 107886, 107891) 

. NAA, Trimec (EXPT IDs 108566, 108567, 108517, 108580, 108581, 108557, 

108478, 108479, 108480, 108481, 108562, 108563, 108519, 108520, 108521, 
25 108484, 108485, 108486, 108564, 108565, 108516, 108554, 108466, 107886, 

107891) 

4. PROTEINS THAT ARE COMMON TO DISPARATE 
RESPONSES CAN BE IDENTIFIED BY 
30 TRANSCRIPTIONAL ANALYSES OVER A NUMBER 

OF EXPERIMENTS 

Phenotypes and traits result from complex interactions between cellular pathways and 
networks. Which pathways are linked by expression of common genes to specify particular 
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traits can be discerned by identifying the genes that show differential expression of seemingly 
disparate responses or developmental stages. For example, hormone fluxes in a plant can direct 
cell patterning and organ development. Genes that are differentially expressed both in the 
hormone experiments and organ development experiments would be of particular interest to 
control plant development. 

Examples Of Such Pathway Interactions Include: 

(i) The Interaction Between Stress Tolerance Pathways And Metabolism Pathways; 

(ii) Interaction Between Hormone Responses And Developmental Changes In The 
Plant; 

(iii) Interactions Between Nutrient Uptake And Developmental Changes; 

(iv) Mediation Of Stress Response By Hormone Responses; And 

(v) Interactions Between Stress Response And Development. 
Applicant's experiments can be grouped together to identify proteins that participate in 
interacting pathways or networks. Specific groups of experiments include, for example: 



(i) Stress & Metabolism 

• Germinating Seeds (Day 1), Arab_0. luM_Epi-Brass_l , 

Arab_N03_H-to-L_l, Arab_100uM_GA3_l (EXPT IDs 108461, 
108580, 108592, 108562) 

(ii) Hormones & Development 

. NAA, BA & Root Tips (EXPT IDs 108566, 108567, 108517, 

108564, 108565, 108516, 108554, 108434, 108466, 107886, 107891) 
. NAA, Roots & Root Tips (EXPT IDs 108564, 108565, 1085 16, 

108554, 108599, 108434, 108439, 108466, 107886, 107891) 
. NAA, BA, Roots And/Or Root Tips (EXPT IDs 108564, 108565, 

108516, 108554, 108599, 108434, 108439, 108466, 107886, 107891, 

108566, 108567, 108517) 
. NAA, B A And Leaf (EXPT IDs 1 08566, 1 08567, 1 085 1 7, 1 085 1 8, 

108529, 108512, 108497, 108498, 108598, 108564, 108565, 108516, 

108554, 108466, 107886, 107891) 
. NAA, BA, Leaves, Roots And/Or Root Tips (EXPT IDs 108566, 

108567, 108517, 108518, 108529, 108512, 108497, 108498, 108598, 

108564, 108565, 108516, 108554, 108466, 107886, 107891, 108599, 

108434, 108439) 
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. ABA & Siliques (Of Any Size) (EXPT IDs 108560, 108561, 108513, 

108597, 108436, 108437, 108438) 
. GA, Imbibed & Germinating Seeds, ABA & Siliques (Of Any Size) 
(EXPT IDs 108560, 108561, 108513, 108597, 108562, 108563, 
108519, 108520, 108521, 108484, 108485, 108486, 108461, 108462, 
108463, 108464, 108528, 108529, 108530, 108531, 108545, 108546, 
108547, 108518, 108529, 108543, 108544, 108436, 108437, 108438) 

• Tissue Specific Expression (3 week rosette leaves), 
ArabO. luM_Epi-Brass_l , Arab_100uM_GA3_l, Germinating 
Seeds (Day 1), (EXPT IDs 108461, 108497, 108580, 108562, 
108461) 

Nutrient Uptake And Development 

. Any Or All Nitrogen Experiments With Siliques (Of Any Size) 
(EXPT IDs 108592, 108593, 108588, 108589, 108590, 108591, 
108532, 108548, 108549, 108550, 108551, 108454, 108455, 108487, 
108488, 108489, 108436, 108437, 108438) 

. Any Or All Nitrogen Experiments With Roots Or Root Tips (EXPT 
IDs 108518, 108529, 108592, 108593, 108588, 108589, 108590, 
108591, 108532, 108548, 108549, 108550, 108551, 108454, 108455, 
108487, 108488, 108489, 108594, 108433, 108599, 108434, 108439) 

Stress & Hormones 

. ABA, Drought (EXPT IDs 108560, 108561, 108513, 108597, 

108572. 108573. 108502, 108503, 108504, 108556, 108482, 108483, 
108473, 108474, 108477) 

. ABA, Drought, Cold, Heat, & Wounding (EXPT IDs 108560, 

108561, 108513, 108597, 108578, 108579, 108533, 108534, 108572, 

108573. 108502. 108503, 108504, 108556, 108482, 108483, 108473, 
108474, 108477, 108576, 108577, 108522, 108523, 108574, 108575, 
108524, 108525) 

• Tissue Specific Expression (3 week rosette leaves), 
Arab_100uM_ABA_l, Ws Arabidopsis Drought 2 days, Ws 
Arabidopsis Drought 4 days (EXPT IDs 108497, 108560, 108477, 
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108482) 



(v) 



Stress & Hormones Stress & Hormones 



Nitrogen High transition to Low, Arab_N03_H-to-L_l, Tissue 
Specific Expression (<5mm siliques), Tissue Specific Expression (5- 
10mm siliques) (EXPT IDs 108455, 108592, 108437, 108436) 



A. 1 .c. OBSERVATIONS OF PHENOTYPIC CHANGES SHOW 



WHAT PHYSIOLOGICAL CONSEQUENCES APPLICANTS' 
PROTEINS CAN PRODUCE 



Another direct means of determining the physiological consequences of a protein is to 
make aberrant decreases or increases of its expression level in a cell. To this end, Applicants 
have produced plants where specific genes have been disrupted, or produced plants that include 
an extra expressed copy of the gene. The plants were then planted under various conditions to 
determine if any visible physiological changes are caused. These changes then are attributed to 
the changes in protein levels. 

LB. SPECIFICITY OF REGULATORY SEQUENCE ARE UNCOVERED BY 
THE CERES GENOMIC ENGINE 

I.B.l. DIFFERENTIAL EXPRESSION RESULTS EXPLAIN WHICH 
EXTERNAL OR INTERNAL STIMULI TRIGGER THE 
REGULATORY SEQUENCES 
Transcriptional studies can reveal the time and place that genes are expressed. 
Typically, regulatory sequences, such as promoters, introns, UTRs, etc., control when and in 
which cells transcription occurs. Differential studies can explain the temporal- and location- 
specific regulatory sequences that control transcription. 

Using the experiments that are provided herein, one skilled in the art can choose a 
promoter or any other regulatory sequence that is capable of facilitating the desired pattern of 
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transcription. For example, if a promoter is needed to give rise to increased levels of 
transcription in response to auxin, but little expression in response to cytokinin, then the 
promoters of cDNAs that were up-regulated in the auxin experiments, but down-regulated the 
cytokinin experiments would be of interest. 



Evaluation of time-course data as described above is also useful to identify time-specific 
promoters. Promoters or regulatory sequences, like the coding sequences, can be classified as 
early or late responding according to the microarray data. Promoters that facilitate expression of 
early or late genes are useful to direct expression of heterologous coding sequences to modulate 
the cellular response. In the drought data, promoters from "early" responding genes can be 
selected to activate expression of any desired coding sequence. Thus, a coding sequence for a 
salt-tolerance protein that is not typically expressed early in response to drought could be linked 
to an "early" responding promoter to increase salt tolerance within one hour after exposure to 
drought conditions. 



Another class of time-sensitive promoters and other regulatory sequence can be 
identified from the experiments examining different developmental stages. These regulatory 
sequences can drive transcription of heterologous sequence at particular times during 
development. For example, expression of stress-responsive genes during fruit development can 
protect any gain in fruit yield. 

Common To Many Pathways - Cause General Effects 
Promoters and other regulatory sequence associated with cDNAs that are differentially 
expressed in a number of similar responses can be used to cause general effects. These types of 
regulatory sequences can be used to inhibit or increase expression of a desired coding sequence 
in a number circumstances. For example, protein that is capable of acting as an insecticide can 
be placed under the control a general "stress" promoter to increase expression, not only when the 
plant is wounded, but under other stress attack. 



H. EXPERIMENTAL RESULTS ALSO REVEAL PATHWAYS OR NETWORKS OF 



Time Course Experiments - Time Sensitive 



Developmental Experiments - Time Sensitive 



GENES 
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II A. GENES WHOSE TRANSCRIPTION ARE WELL COORDINATED 
GENERALLY ACT TOGETHER TO PRODUCE PROTEINS THAT 
PARTICIPATE IN THE SAME PATHWAY OR NETWORK 
5 Patrick Brown, one of the pioneers of microarray chip technology, demonstrated that 

differential expression experiments can identify groups of genes that encode proteins that 
participate the same pathway or network. The work focused on phosphate accumulation and 
metabolism genes in yeast and was published in the paper Ogawa et al. 9 Mol Biol Cell (2000) 
Dec;l 1(12):4309-21. The authors identified by microarray analysis 22 genes whose transcription 
1 0 was regulated by phosphate concentration. Promoter analysis of these genes showed that 21 of 
them contained a sequence in their promoters that is recognized by a transcriptional activator 
that is regulated by phosphate. Further, phenotypic studies were completed by mutational 
analysis of many of these 22 genes in yeast. The mutants were shown to be either severely 
O deficient in accumulation of inorganic polyphosphate (polyP) and P(i), or associated with 
J5 normal catabolism of polyP in the yeast vacuole. This publication proves that genes with 
£ correlated transcriptional profiles do indeed participate in the same pathway or network. 
\ y 

m 

' n II.A.1. CALCULATING THE CORRELATION COEFFICIENT BETWEEN 

§0 PAIRS OF GENES BASED ON THE DIFFERENTIAL 

T EXPRESSION DATA 

^ The differential expression data obtained over many experiments reveal the global 

pattern of transcription of a gene. Transcription patterns, also referred to as profiles, of two 
different genes can be compared. From this comparison, a correlation coefficient can be 
25 calculated as a measure of the strength of the relationship between the two profiles. 

Transcription profiles can be compared by plotting as a point, the differential expression 
of genel on the x-axis and gene 2 on the y-axis on one experiment. If all the pairs lie on a 
regression line the relationship and correlation between the two genes are strong. The 
correlation coefficient can be calculated using a number of methods. In the present case, the 
30 Spearman method was utilized. 

The correlation coefficient can vary from -1 to 1. The coefficient indicates the 
strength of the relationship between two mRNA transcripts of any set of data that is 
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examined. A zero coefficient indicates that no correlation exists between the transcription 
profiles of two genes in the samples examined. 

Biologically, a high correlation coefficient indicates that a gene(s) triggers the 
activation or repression of the correlated genes, or have related functional roles. Thus, 
5 illumination of the activity of one gene can indicate the activities of the genes with highly 
correlated transcription profiles. This implication is true whether the activity is a biochemical 
activity, molecular interaction, cellular response, or physiological consequence. 

II.A.2. THE COMPLETE LINKAGE ANALYSES OF DIFFERENTIAL 
10 IDENTITY GENES WITH SIMILAR PATTERN OF 

TRANSCRIPTION 

The complete linkage analysis can build groups (or "clusters") of genes whose 
transcription patterns are highly correlated or co-regulated. 
O Because genes with related functions are frequently expressed in similar patterns, 

Jh5 utilities or roles can be ascribed for genes (without observation of transformed plants) based 
% on their temporal association with other genes of known function (a "guilt-by-association M 
rir| analysis). Ogawa et al. has used correlated mRNA transcription profiles to identify the 
rp function of proteins of unknown function. 

;L The complete linkage analysis utilizes the correlation coefficients that are calculated 

C20 , for each pair of genes tested in the microarray experiments. A cluster is first seeded with any 
jI arbitrary transcript tested on the chip. The seed transcript, for this illustration, is designated 
^ mRNA#0. Next, a minimum threshold is chosen for all acceptable correlation coefficients. 
In this case, the threshold used was 0.75. A list of potential cluster members is compiled by 
choosing mRNA transcripts that have a correlation coefficient with mRNA#0 that is greater 
25 than the threshold. No limit is placed on the number of mRNAs that can be added to a cluster 
so long as the correlation coefficient meets the threshold limit criterion. 

For this example, assume that four mRNAs were added to the cluster, mRNA_l to 
mRNA_4. Once the potential cluster members are identified, the cDNA IDs of each member 
is added to the potential list in order its correlation coefficient to mRNA#l, the largest 
30 correlation coefficient first. For this example, let's suppose four mRNAs 1-4 are potential 
members, they would be ordered as follows: 



MRNA# 



Correlation Coefficient 
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with mRNA#0 


MRNA#1 


0.9 


MRNA#2 


0.8 


MRNA#3 


0.78 


MRNA#4 


0.75 



A potential member is accepted into the group, if its correlation coefficients with all 
other potential members are all greater than the threshold. Thus, for mRNA#l to remain in 
the group the correlation coefficient between mRNA#l and mRNA#2 must be greater than 
0.75; and mRNA#l and #3 > 0.75; and mRNA#l and mRNA#4 > 0.75. Potential cluster 
members are removed only after reviewing the correlation coefficients in a specific order 
where mRNAs are reviewed in the order that they appear on the list. 

Consequently, review of the correlation coefficients does not begin with any random 
pair, such as mRNA#3 and mRNA#4. The review begins between mRNA#l and mRNA#2, 
which are the top two on the list. 

If correlation coefficient between mRNA#l and mRNA#2 is less than the threshold, 
then mRNA#2 is removed from the cluster. mRNA#2 is removed because its correlation 
coefficient with mRNA#0 is 0.8 which is less than 0.9, the correlation coefficient of 
mRNA#l andmRNA#0. 

This illustrates the rule that if the correlation coefficient is less than the threshold, then 
only one of the pair not accepted as a cluster member, specifically, the one with the lower 
coefficient to the seed mRNA#0. 

This process of iterative reviewing of correlation coefficients between potential members 
continues until all pairs are reviewed. In this case, the coefficient between mRNA#l and 
mRNA#3 would be reviewed because these are the two highest ones on the list besides 
mRNA#l and #2. The next pair to be reviewed would be mRNA#l and #4, etc. 

Applicants have analyzed the data using several sets of parameters for the 
complete linkage analysis as shown in the table below: 



Method 


Correlation 
Coefficient 
Threshold 


Max number of 
members in a cluster 


Organism 


CL METHOD TYPE= 
TRUE 


0.9 


MAX_SIZE=15 


Arabidopsis 


CL METHOD_TYPE= 


0.75 


MAX_SIZE=30000 


Arabidopsis 
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TRUE 








CL METHOD TYPE= 
TRUE 


0.70 


MAX_SIZE=30000 


Arabidopsis 


CL METHOD TYPE= 
TRUE 


0.9 


MAX_SIZE=15 


Zea 


CL METHOD TYPE= 
TRUE 


0.75 


MAX_SIZE=30000 


Zea 


CL METHOD TYPE- 
TRUE 


0.70 


MAX_SIZE=30000 


Zea 


CL METHOD TYPE= 
TRUE 


0.9 


MAX_SIZE=15 


Arabidopsis 


CL METHOD TYPE= 
TRUE 


0.75 


MAX_SIZE=30000 


Arabidopsis 


CL METHOD TYPE- 
TRUE 


0.70 


MAX_SIZE=30000 


Arabidopsis 


CL METHOD TYPE= 
TRUE 


0.9 


MAX_SIZE=15 


Zea 


CL METHOD TYPE= 
TRUE 


0.75 


MAX_SIZE=30000 


Zea 


CL METHOD TYPE= 
TRUE 


0.70 


MAX_SIZE=30000 


Zea 



The results of these cluster analyses are reported in the MA_clust table. 



n.A.3. THE NEAREST NEIGHBOR ANALYSES OF DIFFERENTIAL 
GROUP GENES WITH CORRELATED BUT DISSIMILAR 
TRANSCRIPTION PROFILES 

The nearest neighbor analysis differs from the complete linkage algorithm by not 
requiring all members to meet the correlation threshold with each other. Thus, a member of a 
nearest neighbor cluster need only be closely correlated to one other member of the cluster. 
It is not even required that all members be closely correlated to the seed mRNA transcript. 

In a complete linkage cluster all the transcription profile of all members are correlated 
to a greater or lesser extent. In contrast, a cluster deduced by the nearest neighbor analysis 
may include members with differing transcription profiles. However, nearest neighbor brings 
to light clusters of interacting genes. In the nearest neighbor analysis, the seed mRNA may 
not have a very high correlation coefficient with the last mRNA added to the cluster. 

The nearest neighbor analysis, like the complete linkage analysis, is initiated by 
seeding each cluster with a mRNA_0. The cluster size is determined by setting a threshold 
coefficient and setting a limit on the number of members that can be added to the cluster. 
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The cluster is^xpanded in an iterative fashion determining which mRNA has the 
highest correlation coefficient with mRNA_0. The additional member is labeled mRNA_l. 
Next, a list of potential candidates is generated by finding the mRNA that has the highest 
correlation to mRNA_0 (besides mRNA_l) and finding the mRNA that has the highest 
coefficient with mRNA_l. Whichever of the candidates has the highest correlation 
coefficient is added to the cluster. Then, a list of three potential candidates is generated 
similarly. 

Addition of members continues until either (1) all the correlation coefficients of 
potential members is lower than the threshold or (2) number of members in the cluster meets 
the size limitation. 



Applicants have analyzed the data using several sets of parameters for the nearest 



neighbor analysis as shown in the table below: 



Method 


Correlation 
Coefficient 
Threshold 


Max number of 
members in a 
cluster 


Organism 


NN METHOD TYPE=TRUE 


0.5 


MAX_HITS=15 


Arabidopsis 


FULL NN METHODTYPE 

=TRUE 


0.8 


NONE 


Arabidopsis 


FULL NN METHOD TYPE 
=TRUE 


0.6 


NONE 


Arabidopsis 


NN METHOD TYPE=TRUE 


0.5 


MAX HITS=15 


Zea 


FULL NN METHOD TYPE 

=TRUE 


0.8 


NONE 


Zea 


FULL NN METHOD TYPE 
=TRUE 


0.6 


NONE 


Zea 


NN METHOD TYPE=TRUE 


0.5 


MAXfflTS=15 


Arabidopsis 


FULL NN METHOD TYPE 

=TRUE 


0.8 


NONE 


Arabidopsis 


FULL NN METHOD TYPE 
=TRUE 


0.6 


NONE 


Arabidopsis 


NN METHOD_TYPE=TRUE 


0.5 


MAX HITS=15 


Zea 


FULL NN METHOD TYPE 

=TRUE 


0.8 


NONE 


Zea 


FULL NN METHOD TYPE 


0.6 


NONE 


Zea 
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The results of these cluster analyses are reported in the MA_clust table. 

m. EXPERIMENTAL RESULTS REVEAL THE FUNCTIONS AND 

CHARACTERISTICS OF GENES, PATHWAYS AND NETWORKS 

m. A. LINKING BIOCHEMICAL OR METABOLIC ACTIVITIES OF ONE 

PROTEIN IN A CLUSTER TO THE OTHER PROTEINS IN THE SAME 
MICROARRAY CLUSTER 
As shown in the Ogawa et a/., Mol Biol Cell (2000), genes whose transcription 
profiles cluster together as being strongly correlated typically take part in the same pathway 
or network. Thus, the activity of one gene in the cluster can be associated to the other genes 
in the cluster with highly correlated transcription profiles. This association is true whether 
the activity is a biochemical activity, molecular interaction, cellular response or physiological 
consequence. 

One example of this is cluster 420 of the report (shown below). In this cluster, a protein 
encoded by cDNA ID 1025791 did not match to any pFAM domain. However, through the 
microarray data, the gene that encodes that protein had a transcription profile that was correlated 
with other genes that encode ribosomal proteins. Thus, the activity of the ribosomal genes can 
be associated with the protein with no pFAM match. All the proteins in the same cluster would 
be associated with mRNA translation and protein synthesis. 
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420 


1025791 

I 


803433 


4585878 


(AC005850} 
Unknown 
protein 
[Arabidopsis 




420 


4608965 

I 


671877 


8567795 


(AC013428) 

ribosomal 
protein S17, 
pu 


Ribosomal_ 
S17e 


Ribosomal 
S17 


420 


5663116 

I 


818554 


7486478 


hypothetical 
protein 
F6E13.17- 
Arabidop 


DapB 


Dihydrodipic 

olinate 

reductase 



m.B. USING DIFFERENTIAL EXPRESSION DATA TO DETERMINE WHEN 
THE GENES AND PATHWAYS ARE ACTIVE 

The differential expression data can be used to associate the cellular response that results 
when the clusters of genes are transcribed. For the complete linkage clusters, the genes in the 
cluster will produce similar transcription profiles. The experiments where the genes in the 
cluster are differentially expressed as compared to the control define the cellular responses that 
all the genes of the cluster are capable of modulating. 

For example, for the cluster shown above, the mRNA levels for the genes were 
significantly different in the nitrogen response experiments. Thus, the data shows that this 
cluster of genes is associated with protein synthesis in response to nutrient uptake. 



m.C USING PHENOTYPE DATA TO DETERMINE WHEN GENES AND 
PATHWAYS ARE ACTIVE 

The phenotypic data can be used to demonstrate the physiological consequences of that 
result when a cluster of genes is active. Whether the clusters were generated by the complete 
linkage or the nearest neighbor analyses, if a single gene in the cluster has been implicated in 
phenotypic changes, then any one or combination of the other genes in the cluster can also 
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modulate the same or similar phenotypic changes. 
Utilities of Particular Interest 

The following sections describes utilities/functions for the genes, gene components and 
5 products of the invention. The sequences of the invention, as discussed above, can be recognized 
as a particular type of gene (e.g. root gene, leaf gene, etc.) by means of particular terms utilized 
in the Knock-in and Knock-out Tables and by the results of the differential expression 
experiments. Combined analysis of those data also identify genes with utihties/functions of 
particular interest. The Single Gene Functions and Utilities Table correlates that data and 
1 0 specific genes with those utihties/functions of particular interest. 

Utilities of Particular Interest for Clustered Sequences 

As discussed further herein, the genes, gene components and products of the invention 
C have been clustered together into groups. This enables one to understand the function/utility of 
J;5 one member of the cluster based upon knowledge about one or more other members of the 
2; cluster. In addition, this enables an understanding of some utihties/functions of a cluster that 
fU would be of particular interest. The Cluster Fucntions and Utilities Table lists some of the 

m 

g s i clusters of the invention and notes the functions/utilities that are of particular interest for each of 

L. the clusters. Of course, these functions/utilities are of particular interest for each member of each 

C20 particular cluster. 

ru 

O IV. EXPERIMENTAL RESULTS PROVIDE AN U NDERSTANDING OF GENES, 

La 1 

PATHWAYS AND NETWORKS IN MANY PLANT SPECIES 

25 IV.A. INTRO: USING PROTEIN Sequence SIMILARITY TO ASSIGN 

FUNCTION FROM ARABIDOPSIS PROTEINS TO PROTEINS OF 
OTHER SPECIES 

By analyzing the constant and variable properties of groups of similar sequences, it is 
possible to derive a structural and functional signature for a protein family, which distinguishes 
30 its members from all other proteins. This approach has allowed the Applicants to assign proteins 
into functional groups and identify orthologous proteins both within and between species. A 
pertinent analogy to be considered is the use of fingerprints by the police for identification 
purposes. A fingerprint is generally sufficient to identify a given individual. Similarly, a protein 
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signature can be used to assign a newly sequenced protein to a specific family of proteins and 
thus to formulate hypotheses about its function. 

Proteins can be grouped together because they share a single motif or many motifs. 
Typically, proteins that share a series of motifs share greater functional equivalence. Usually, 
signature sequences comprise more than one motif in a particular order from N-terminus to C- 
terminus. 

A list of these groups can be found in the Protein Group Table. The sequences were 
grouped together using the iterative protein sequence local alignment software, PSI-BLAST. 
This software begins by aligning a number sequences where the probability that the alignment 
occurred by chance is set by a threshold e-value. In the Applicants' case, the threshold e-value 
was set at 10~ 50 , 10" 30 , and 10~ 10 . The algorithm generates a consensus sequence from the 
sequences that were aligned together. The consensus sequence was then used to find sequences 
that matched to it with a probability that was less than the set threshold. The algorithm performs 
the iterative tasks of aligning and generating a consensus sequence any number of times. 
Generally, Applicants performed one iteration for the 10* 10 e-value threshold, two iterations for 
the 10" 30 threshold, and three iterations for the 10" 50 threshold. 

Each group can contain sequences from one of more organisms. The groups included 
both Ceres polypeptides and public polypeptide sequences. The Ceres polypeptides are 
identified by their Ceres Sequence ID NO as listed in the Reference Table. 

Each group contains sequences that were included at the 10~ 50 , 10~ 30 , and 10" 10 e-value 
cutoffs. For each group, the peptide ID and at which cutoff the peptide was included into the 
group. The same peptide ID may be included in the group three times as peptide ID 50, peptide 
ED 30 and peptide ID 10. The data indicates that peptide ID was included in the group when the 
threshold was either 10' 50 , 10" 30 , or 10" 10 . All the peptide IDs that are followed by "50" were 
included in the protein group when the e-value cutoff was 10" 50 . All the peptide IDs that are 
followed by either "30" or "50" were included in the protein group when the threshold e-value 
was 10' 30 . All the peptide IDs that are followed by "10", "30" or "50" were included in the 
protein group when 10" 10 was used as the e-value cutoff. 

IV.A.1. CONSERVED SEQUENCES BETWEEN PROTEINS OF 

DIFFERENT SPECIES GIVE RISE TO A SIGNATURE SEQUENCE 
The signature sequence for each group of proteins, also referred to as the consensus 
sequence. The signature sequence comprises the amino acids that are conserved throughout all 
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the proteins in a particular protein group. The data are shown in the Protein Group table. 

Not all the polypeptides in a group are the same length. Thus, some members of the 
group may not contain the entire signature sequence. However, throughout the length of any 
member protein, its sequence will match the signature sequence. 

The consensus sequence contains both lower-case and upper-case letters. The upper- 
case letters represent the standard one-letter amino acid abbreviations. The lower case letters 
represent classes of amino acids: 

• "t" refers to tiny amino acids, which are specifically alanine, glycine, serine and 
threonine. 

• "p"refers to polar amino acids, which are specifically, asparagine and glutamine 

• "n" refers to negatively charged amino acids, which are specifically, aspartic acid 
and glutamic acid 

• "+" refers to positively charged residues, which are specifically, lysine, arginine, 
and histidine 

• "r" refers to aromatic residues, which are specifically, phenylalanine, tyrosine, and 
tryptophan, 

• "a" refers to aliphatic residues, which are specifically, isoleucine, valine, leucine, 
and methonine 

In addition to each consensus sequence, Applicants have generated a scoring matrix to 
provide further description of the consensus sequence. The matrix reports the identitiy and 
number of occurences of all the amino acids that were found in the group members for every 
residue position of the signature sequence. The matrix also indicates for each residue position, 
how many different organisms were found to have a polypeptide in the group that included a 
residue at the relevant position. These results are reported in the Protein Group Matrix table. 

Functional equivalents share similar (1) structural characteristics; (2) biochemical 
activities and molecular interactions; (3) cellular responses or activities; or (4) phenotypic 
effects. 




IV.A.2.LINKING SIGNATURE SEQUENCES TO CONSERVATION OF 
STRUCTURAL CHARACTERISTICS 
Proteins with related functions show similar three-dimensional structures but may not 
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show extensive amino acid sequence similarity. Typically, proteins need only share a single 
motif or low similarity in multiple domains to exhibit similar structural features, such as alpha 
helix, beta sheet, charge residues, stretches of hydrophobicity, etc. Conserved structural features 
have been implicated in ligand binding by receptor proteins, binding to a class of substrates, 
polynucleotide binding, or protein-protein interactions. 

Based on the signature sequences and the Matrix Tables described herein, a number of 
motifs can be discerned. Motifs are identified as regions in the signature sequence which are 
constant in a majority of the members of the group. Example motifs can be found among 
Applicant's data which are shared in the range of 75% to 95% of group memmbers 

Typically, a region of the consensus sequence is constant if, at each position of the 
region, the preferred amino acid is chosen from a single class of amino acids; even more 
typically, the preferred amino acid is a single amino acid. The region can contain a number of 
positions where an amino acid can be chosen. However, these variable positions are usually less 
than 15% of the total number of residues in the region; more usually, less than 10%; even more 
usually, less than 5%. 

Generally, a domain is considered to be well defined if the consensus sequence is 
constructed from sequences from at least 2 organisms; more preferably, at least 3 organisms; 
even more preferably four organisms or greater. 

Primary domains are best identified from the data presented for the 10" 10 probability 
criteria. Using this parameter, the largest number of proteins is associated into a group. 
Consequently, the signature sequence exhibits the greatest amount of variability. The conserved 
regions, the domains or motifs of the signature contrast against the variable regions. These 
variable regions become obvious when sequences from more proteins are compared. 

Signature sequences revealed in the 10~ 30 and 10" 50 e-value classes show more 
conservation in the domains, and can even display a degree of conservation in what is 
considered the variable regions in the 10* 10 analyses. These more extensively-conserved 
domains can reflect higher similarity in function - completely orthologous functions. Proteins 
that share a number of conserved domains, in the same relative order from N terminus to C 
terminus, are even more likely to be completely orthologous. Nevertheless, because of the 
natural divergence that occurs in non-conserved regions during evolution and species 
differentiation, orthologs can be proteins with only the domains conserved and therefore be 
present in the 10" 30 and 10" 10 p value classes of the Ortholog Table. 
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IV. A.l LINKING SIGNATURE SEQUENCES TO CONSERVATION OF 

BIOCHEMICAL ACTIVITIES AND MOLECULAR INTERACTIONS 
Proteins that possess the same defined domains or motifs are likely to carry out the same 
biochemical activity or interact with a similar class of target molecule, e.g., DNA, RNA, 
proteins, etc. Thus, the pFAM domains listed in the Reference Tables are routinely used as 
predictors of these properties. Substrates and products for the specific reactions can vary from 
protein to protein. Where the substrates, ligands, or other molecules bound are identical the 
affinities may differ between the proteins. Typically, the affinities exhibited by different 
functional equivalents varies no more than 50%; more typically, no more than 25%; even more 
typically, no more than 10%; or even less. 

Proteins with very similar biochemical activities or molecular interactions will share 
similar structural properties, such as substrate grooves, as well as sequence similarity in more 
than one motif Usually, the proteins will share at least two motifs of the signature sequence; 
more usually, three motifs; even more usually four motifs or greater. Typically, the proteins 
exhibit 70% sequence identity in the shared motifs; more typically, 80% sequence identity; even 
more typically, 90% sequence identity or greater. These proteins also often share sequence 
similarity in the variable regions between the constant motif regions. Further, the shared motifs 
will be in the same order from amino- to carboxyl-termini. The length of the variable regions 
between the motifs in these proteins, generally, is similar. Specifically, the number of residues 
between the shared motifs in these proteins varies by less than 25%; more usually, does not vary 
by less than 20%; even more usually, less than 15%; even more usually less than 10% or even 
less. 

IV.A.4.LINKING SIGNATURE SEQUENCES TO CONSERVATION OF 
CELLULAR RESPONSES OR ACTIVITIES 
Proteins that exhibit similar cellular response or activities will possess the structural and 
conserved domain/motifs as described in the Biochemical Activities and Molecular Interactions 
above. 

Proteins can play a larger role in cellular response than just their biochemical activities 
or molecular interactions suggest. A protein can initiate gene transcription, which is specific to 
the drought response of a cell. Other cellular responses and activities include: stress responses, 
hormonal responses, growth and differential of a cell, cell to cell interactions, etc. 

The cellular role or activities of protein can be deduced by transcriptional analyses or 
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phenotypic analyses as well as by determining the biochemical activities and molecular 
interactions of the protein. For example, transcriptional analyses can indicate that transcription 
of gene A is greatly increased during flower development. Such data would implicate protein A 
encoded by gene A, in the process of flower development. Proteins that shared sequence 
similarity in more than one motif would also act as functional equivalents for protein A during 
flower development. 



IV.A.5.LINKING SIGNATURE SEQUENCES TO CONSERVATION OF 
PHENOTYPIC EFFECTS 
Typically, proteins that are grouped together under the most stringent parameters, e- 
value <10' 50 , are likely orthologs and therefore, when present in the same or equivalent cells can 
cause similar phenotypic consequences. These proteins have very high sequence similarity. 
Typically, if one of the members of a group is an Arabidopsis protein, then the corn ortholog can 
rescue an Arabidopsis mutant plant that does not produce the Arabidopsis protein. The mutant 
plant would be rescued as the parental "wild-type" phenotype by expression of a coding 
sequence of the corn protein of the same orthologous group when present in the appropriate cell 
types of the plant. 

Preferably, these functional equivalents have sequence motif identity throughout much 
of the length of the protein. However, proteins that share very high similarity between a 
number, usually more than two; even more usually, more than three motifs can act as functional 
equivalents to produce similar phenotypic effects. 

A gene can have coding sequence similarity, i.e., is a homologous. The coding sequence 
can be sufficient to act as a functional equivalent, although the gene as a whole is not an 
ortholog. For example, two similar dwf4 coding sequences were found in the Arabidopsis 
genome. However, this pair of coding sequences had different promoters and hence different 
roles in Plantae . But when one of the pair was placed under the control of its mates' promoter, 
the phenotypic effects were similar to the effects produced by its mate coding sequence. 
Therefore, the coding sequence, but not the genes are orthologous. 
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APPLICATIONS OF THE INVENTIONS 

As described herein, the results of Applicant's experiments provide an understanding of 
the function and phenotypic implications of the genes, gene components and products of the 
present invention. Bioinformatic analysis provides such information. The sections of the present 
application containing the bioinformatic analysis, together with the Sequence and Reference 
Tables, teach those skilled in the art how to use the genes, gene components and products of the 
present invention to provide plants with novel characteristics. Similarly, differential expression 
analysis provides additional such information and the sections of the present application on that 
analysis; together with the MA_Diff Tables and MA_Cluster Tables, describe the functions of 
the genes, gene components and products of the present invention which are understood from 
the results of the differential expression experiments. The same is true with respect to the 
phenotype data, wherein the results of the Knock-in and Knock-out experiments and the sections 
of the present application on those experiments provide the skilled artisan with further 
description of the functions of the genes, gene components and products of the present 
invention. 

As a result, one reading each of these sections of the present application as an 
independent report will understand the function of the genes, gene components and products of 
the present invention. But those sections and descriptions can also be read in combination, in an 
integrated manner, to gain further insight into the functions and uses for the genes, gene 
components and products of the present invention. Such an integrated analysis does not require 
extending beyond the teachings of the present application, but rather combining and integrating 
the teachings depending upon the particular purpose of the reader. 

Some sections of the present application describe the function of genes, gene 
components and products of the present invention with reference to the type of plant tissue (e.g. 
root genes, leaf genes, etc.), while other sections describe the function of the genes, gene 
components and products with respect to responses under certain conditions (e.g. auxin- 
responsive genes, heat- responsive genes, etc.). Thus, if one desires to utilize a gene understood 
from the application to be a particular tissue-type of gene, then the condition-specific 
responsiveness of that gene can be understood from the differential expression tables, and very 
specific characteristics of actions of that gene in a transformed plant will be understood by 
recognizing the overlap or intersection of the gene functions as understood from the two 
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different types of information. Thus, for example, if one desires to transform a plant with a root 
gene for enhancing root growth and performance, one can know the useful root genes from the 
results reported in the knock-in and knock-out tables. A review of the differential expression 
data may then show that a specific root gene is also over-expressed in response to heat and 
osmotic stress. The function of that gene is then described in (1) the section of the present 
application that discusses root genes, (2) the section of the present application that discusses 
heat-responsive genes, and (3) the section of the application that discusses osmotic stress- 
responsive genes. The function(s) which are commonly described in those three sections will 
then be particularly characteristic of a plant transformed with that gene. This type of integrated 
analysis of data can be viewed from the following schematic that summarizes, for one particular 
gene, the function of that gene as understood from the phenotype and differential expression 
experiments. 



%5 



Gene function known 
from phenotype 
experiments 



Gene function known 
from first differential 
expression experiment 



Gene function known 
from second differential 
expression experiment 



25 



Function A 
Function B 



Function F 
Function G 

Function I 



Function A 

Function C 
Function D 

Function F 
Function G 



Function A 

Function C 

Function E 
Function F 

Function H 
Function I 



Function J 



30 

In the above example, one skilled in the art will understand that a plant transformed with this 
particular gene will particularly exhibit functions A and F because those are the functions which 
are understood in common from the three different experiments. 
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Similar analyses can be conducted on various genes of the present invention, by which one 
skilled in the art can effectively modulate plant functions depending upon the particular use 
or conditions envisioned for the plant. 
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I. 



ORGAN-AFFECTING GENES, GENE COMPONENTS, PRODUCTS 
(INCLUDING DIFFERENTIATION AND FUNCTION) 



LA. ROOT GENES, GENE COMPONENTS AND PRODUCTS 



LA.1. ROOT GENES, GENE COMPONENTS AND PRODUCTS 



The economic values of roots arise not only from harvested adventitious roots or 
tubers, but also from the ability of roots to funnel nutrients to support growth of all plants and 
increase their vegetative material, seeds, fruits, etc. Roots have four main functions. First, 
they anchor the plant in the soil. Second, they facilitate and regulate the molecular signals 
and molecular traffic between the plant, soil, and soil fauna. Third, the root provides a plant 
with nutrients gained from the soil or growth medium. Fourth, they condition local soil 
chemical and physical properties. 



Root genes identified herein are defined as genes, gene components and products 
capable of modulating one or more processes in or functions of the root as described below. 
They are active or potentially active to a greater extent in roots than in most other organs of 
the plant. These genes and gene products can regulate many plant traits from yield to stress 
tolerance. That single genes usually affect the development and function of roots and whole 
plants is a consequence of biological cellular complexity and the role roots play in supporting 
the growth of whole plants. Examples of such root genes and gene products are shown in the 
Reference and Sequence Reference and Sequence Tables and sequences encoding 
polypeptides of the Protein Group and Protein Group Matrix tables or fragments thereof, the 
Knock-In and Knock-Out Tables, and the MA-diff Tables. The function of many of the 
protein products gained from comparisons with proteins of known functions, are also given in 
the REF Tables. 

Root Genes Identified By Phenotypic Observations 

Root genes are active or potentially active to a greater extent in roots than in some 
other organs/tissue of the plant. Some of the root genes herein were discovered and 
characterized from a much larger set of genes in experiments designed to find genes that 



a. 



Identification of Root Genes 
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cause phenotypic changes in root morphology. Such morphological changes include primary 
and lateral root number, size and length, as well as phenotypic changes of other parts of that 
plant associated with changes in root morphology. 

In these experiments, root genes were identified by either (1) ectopic expression of a 
cDNA in a plant or (2) mutagenesis of the plant genome. The plants were then cultivated 
under standardized conditions and any phenotypic differences recorded between the modified 
plants as compared with the parent plant. The gene(s) causing the changes were deduced 
from the cDNA inserted or disrupted gene. Phenotypic differences were observed in: 



Primary Roots And Root System 

• Size, Including Length And Girth 

• Number 

• Branching 

. Root Waving/Curling Characteristics 

• Gravitropism Changes 

• Agravitropic 
Lateral Roots 

• Size, Including Length And Girth 

• Number 

. Branching 



Results from screening for these phenotypic changes are reported in the Knock-in and 
Knock-out Tables. Therefore, any sequence reported in those Tables with one of the above- 
noted observations is considered a "root gene". A "root gene" is also a sequence which, in the 
Ortholog Tables or in the MA-clust Tables, is grouped/clustered together with at least one 
sequence that is identified as such by means of the Knock-in and Knock-out Tables. 

Root Genes Identified By Differential Expression 

Root genes were also identified by measuring the relative levels of mRNA products in 
the root versus the aerial portion of a plant. Specifically, mRNA was isolated from roots and 
root tips of Arabidopsis plants and compared to mRNA isolated from the aerial portion of the 
plants utilizing microarray procedures. The MA_diff Table(s) reports the transcript levels of the 
experiment (see EXPT ID: 108594, 108433, 108599, 108434, 108439 ). For transcripts that had 
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higher levels in the samples than the control, a "+" is shown. A "-" is shown for when transcript 
levels were reduced in root tips as compared to the control. For more experimental detail see the 
Example section below. 

Roots genes are those sequences that showed differential expression as compared to 
controls, namely those sequences identified in the MA_diff tables with a "+" or "-" indication 

Roots Genes Identified By Cluster Analyses Of Differential Expression 

Roots Genes Identified By Correlation To Genes That Are Differentially 

Expressed 

As described above, the transcription profiles of genes that act together are well 
correlated. Applicants not only have identified the genes that are differentially expressed in the 
microarray experiments, but also have identified the genes that act in concert with them. The 
MA clust table indicates groups of genes that have well correlated transcription profiles and 
therefore participate in the same pathway or network. 

A pathway or network of Roots genes is any group in the MA_clust that comprises a 
cDNA ID that also appears in Expt ID 108594, 108433, 108599, 108434, 108439 of the 
MA_difftable(s). 

Roots Genes Identified By Correlation To Genes That Cause 
Physiological Consequences 

Additionally, the differential expression data and the phenotypic observations can be 
merged to identify pathways or networks of Roots genes. A group in the MA_clust is 
considered a Roots pathway or network if the group comprises a cDNA ID that also appears in 
Knock-in or Knock-out tables that causes one or more of the phenotypes described in section 
above. 

Roots Genes Identified By Amino Acid Sequence Similarity 
Roots genes from other plant species typically encode polypeptides that share amino acid 
similarity to the sequences encoded by corn and Arabidopsis Roots genes. Groups of Roots 
genes are identified in the Protein Group table. In this table, any protein group that comprises a 
peptide ID that corresponds to a cDNA ID member of a Roots pathway or network is a group of 
proteins that also exhibits Roots functions/utilities. 
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Examples of phenotypes, biochemical activities, and transcription profiles that can be 
modulated by these genes and gene products are described above and below. 

b. Identification of Root Genes 
b. Use of Root Genes to Modulate Phenotypes 
The root genes of the instant invention are capable of modulating one or more 
processes of root structure and/or function including (I) development; (II) interaction with the 
soil and soil contents; and (HI) transport in the plant. 

Root genes and gene products can be used to alter or modulate one or more of the 
following phenotypes. 

I. DEVELOPMENT 

Roots arise from meristem cells that are protected by a root cap during root 
elongation, but as the root grows out, the cap cells abscise and the remaining cells 
differentiate to the tip. Depending on the plant species, some surface cells of roots can 
develop into root hairs. Some roots persist for the life of the plant; others gradually shorten 
as the ends slowly die back; some may cease to function due to external influences. The root 
genes and gene products of this invention are useful to modulate any one or all of these 
growth and development processes generally, as in root density and root growth; including 
rate, timing, direction, size, for example. 

A. Different Types Of Roots 

Root genes and gene products are useful to modulate either the growth and 
development or other processes in one or more of the following types of roots: 

1. Primary 

2. Lateral 

3. Root Hairs (See The Section Below For More Detail) 

4. Adventitious 

B. Cell Properties 

Root genes and gene products are useful to modulate cellular changes in; 

1. Cell Size 

2. Cell Division, Rate Direction And/Or Number 
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3. Cell Elongation 

4. Cell Differentiation 

5. Lignified Cell Walls 

6. Epidermal Cells, Such As Trichoblasts 

7. Root Apical Meristem Cells (Growth And Initiation) 

C. Root Architecture 

The following parts of roots can be modulated by these genes root and gene products to 
affect root architecture: 

1. Epidermis 

2. Cortex 

(a) Epidermis 

(b) Hypodermis 

(c) Endodermis 

(d) Casparian Strips 

(e) Suberized Secondary Walls 

(f) Parenchyma 

(g) Aerenchyma 

3. Stele 

(a) Vaculature 

(i) Xylem 

(ii) Phloem 

(b) Pericycle 

4. Vasculature 

5. Xylem 

6. Phloem 

7. Root Cap 

8. Root Apical Meristem 

9. Elongating Region 

10. Symmetry 

D. Root Responses 

The polynucleotides and polypeptides of this invention can be used to control the 
responses to internal plant programs as well as to environmental stimuli in: 

1. Seminal System 



Page 81 of 772 



Attorney Dkt. i/MM^jSr 
Client Dkt. 80090.005 



2. Nodal System 

3. Hormone Systems 

(a) Auxin (For More Details See The Section On Auxin- 
Responsive Genes) 

(b) Cytokinin (Inhibitory For Root Development, See 
Section On Cytokinin-Responsive Genes) 

4. Root Cap Abscission 

5. Root Senescence 

6. Gravitropism 

7. Coordination Of Root Growth And Development With That Of 
Other Organs, Examples Include: 

(a) Leaves 

(b) Flowers 

(c) Seeds 

(d) Fruits 

(e) Stems 

8. Changes In Soil Environment (For More Detail See Below) 

(a) Water 

(b) Minerals 



(c) 
(d) 



Ph 



Microfauna And Flora 



II. INTERACTION WITH SOIL AND SOIL CONTENTS 



Roots are sites of intense chemical'and biological activities and as a result can 
strongly modify the soil they contact. Roots coat themselves with surfactants and mucilage 
to facilitate these activities. Specifically, roots are responsible for nutrient uptake by 
mobilizing and assimilating water, organic and inorganic compounds, ions and attracting and 
interacting with beneficial microfauna and flora. Roots also help to mitigate the effects of 
toxic chemicals, pathogens and stress. Examples of root properties and activities that the 
genes and gene products of this invention are useful to modulate are as follows: 



A. Root Surfactants And Mucilage 
1. Mucilage 

(a) Composition 
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(b) 



Secretion Rate And Time 



2. Surfactant 
Nutrient Uptake Of 
1. Water 

(a) Which Can Be Measured By The Supply To Shoot On 



2. Nitrate And Other Sources Of Nitrogen (For Detail See The 
Section On Nitrogen Responsive Genes) 

3. Phosphate 

4. Potassium 

5. Micronutrients (E.G. Iron, Copper, Etc,) 
Microbes And Nematodes Associations: 

1. Bacteria Including Nitrogen-Fixing Bacteria 

2. Mycorrhizae 

3 . Nodule-Forming And Other Nematodes 
Oxygen 

1. Transpiration 
Detoxification Of 

1. Iron 

2. Aluminum 

3. Cadium 

4. Mercury 

5. Salt 

6. Other Heavy Metals And Toxins 
Pathogen Interactions/Control 

1 . Chemical Repellents 

(a) Glucosinolates (GSL), Which Release Pathogen- 



The Basis Of Volume/Dry Weight Or Surface Area 



Controlling Isothiocyanates 



Changes In Soil Properties, Such As: 



1. 



Ph 



2. 



Mineral Depletion 



3. 



Rhizosheath 
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m. TRANSPORT OF MATERIALS IN PLANTS 

Uptake of nutrients by roots produces a "source-sink" effect in a plant. The greater 
the source of nutrients, the larger "sinks," such as stems, leaves, flowers, seeds, fruits, etc. 
can grow. Thus, root genes and gene products are useful to modulate the vigor and yield of 
the plant overall as well as distinct cells, organs, or tissues. Examples are as follows: 
A. Vigor 

1. Plant Nutrition 

2. Growth Rate 

• Whole Plant, Including Height, Flowering Time, Etc. 

• Seedling 
Coleoptile Elongation 

• Young Leaves 

• Stems 

• Flowers 

• Seeds 

• Fruit 

3. Yield 

(A) Biomass 

. Fresh And Dry Weight During Any Time In Plant 
Life, Including Maturation And Senescence 

(b) Root/Tuber Yield 

. Number, Size, Weight, Harvest Index 
• Content And Composition, E.G. Amino Acid, 
Jasmonate, Oil, Protein And Starch 

(c) Number Of Flowers 

(d) Seed Yield 

. Number, Size, Weight, Harvest Index 
. Content And Composition, E.G. Amino Acid, 
Jasmonate, Oil, Protein And Starch 

(e) Fruit Yield 

. Number, Size, Weight, Harvest Index, Post Harvest 
Quality 
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Content And Composition, E.G. Amino Acid, 
Jasmonate, Oil, Protein And Starch 




Additional Uses Of Plants With Modified Roots 

Plants with roots modified in one or more of the properties described above in I, II or 
m are used to provide: 

A. Higher vigor and yield of plants and harvested products due to pathogen 
resistance from conditioning the soil with plant-derived chemicals and/or more 
tolerance to stresses such as drought, flooding and anoxia. 

B. Better Animal (Including Human) Nutrition 

C. Improved Dietary Mineral Nutrition 

D. Better Plant Survival 

(a) Decreased Lodging 

(b) More Efficient Transport 

(c) More Efficient Physiology 

(d) More Efficient Metabolism 

E. Better Resistance To Plant Density Effects 

F. Increased Yield Of Valuable Molecules 

G. More Efficient Root Nodulation 

H. Better Access To Rhizobia Spray Application, For Anaerobic Soils 

I. Easier Crop Harvesting And Ground Tillage 
J. Decreased Soil Erosion 

To regulate any of the phenotype(s) above, activities of one or more of the root genes 
or gene products is modulated and tested by screening for the desired trait. Specifically, the 
gene, mRNA levels, or protein levels can be altered in a plant utilizing the procedures 
described herein and the phenotypes can be assayed. As an example, a plant can be 
transformed according to Bechtold and Pelletier (1998, Methods. Mol. Biol. 82:259-266) 
and/or screened for variants as in Winkler et al. (1998) Plant Physiol 1 18: 743-50 and 
visually inspected for the desired phenotype or metabolically and/or functionally assayed 
according to Dolan et al. (1993, Development 119: 71-84), Dolan et al. (1997, Development 
124: 1789-98), Crawford and Glass (1998, Trends Plant Science 3: 389-95), Wang et al. 
(1998, PNAS USA 95: 15134-39), Gaxiola et al. (1998, PNAS USA 95: 4046-50), Apse et al. 
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(1999, Science 285: 1256-58), Fisher and Long (1992, Nature 357: 655-60), Schneider et al. 
(1998, Genes Devel 12: 2013-21) and Hirsch (1999, Curr Opin Plant Biol. 2: 320-326). 

c. Use of Root Genes to Modulate Phenotypes 

c. Use of Root Genes to Modulate Biochemical Activities 



The activities of one or more of the root genes can be modulated to change 
biochemical or metabolic activities and/or pathways such as those noted below. Such 
biological activities can be measured according to the citations included in the Table below: 





BIOCHEMICAL OR 




PROCESS 


METABOLIC ACTIVITIES 


CITATIONS INCLUDING 




AND/OR PATHWAYS 


ASSAYS 


Association Of Root 


. Cell-Cell Recognition 


Gage et al. (1996) J Bacterid 


Morphology With Nitrogen 


. Cell Wall Degradation 


178: 7159-66 


Fixing Bacteria 






Primary Root, Lateral 


. Cell Division/Elongation 


Schneider et al. (1998) Genes 


Root, And Root Hair 


• Cell Differentiation 


Devel 12: 2013-21 


• Initiation 


• Cell Expansion 


Casimiro et al. (2001). Plant 


• Spacing 


• Auxin Mediated Response 


Cell 13:843-852. 


• Elongation 


Pathways 


Roggetal. (2001). Plant 


. Branching 




Cell 13:465-480. 






Gaedekeetal. (2001). 






EMBOJ. 20:1875-1887. 






Neuteboom et al. (1999). 






Plant Mol. Biol. 39:273-287. 






Schindelman et al. (2001). 






Genes andDev. 15:1115- 






1127. 






Rashotte et al. (2001) Plant 






Cell 13:1683-1697. 






Zhang etal. (2000). J Exp 






Bot 51:51-59. 






Zhang et al. (1998) Science 






279: 407-409. 
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BIOCHEMICAL OR 




PROCESS 


METABOLIC ACTIVITIES 


CITATIONS INCLUDING 




AND/OR PATHWAYS 


ASSAYS 


Metabolism 


• Organic Molecule Export 


Moody et al. (1988) 






Phytochemistry 27: 2857-61. 




• Ion Export 


Uozumi et al. (2000) Plant 






Physiol 122: 1249-59 






Frachisse et al. (2000) Plant J 






21: 361-71 




. Nutrient Uptake 


Frachisse et al. (2000) Plant J 






21: 361-71 






Uozumio et al. (2000) Plant 






Physiol 122: 1249-59 






Williamson et al. (2001). 






Plant Physiol. 126:875-882. 






Zhang etal. (2000). J Exp 






Bot 51:51-59. 






Zhang etal. (1998). Science 






279:4 07-409. 






Coruzzietal. (2001). Plant 






Physiol. 125:61-64. 


Root Gravitropism And 


• Reactive Oxygen Species 


Joo et al. (2001) Plant 


Waving 


(ROS) Such As Superoxide 


Physiol. 126:1055-60. 




Anions And H202 


Vitha et al. (2000). Plant 




Production 


Physiol. 122: 453-461. 




• Auxin Transport Pathways 


Tasaka et al. (2001) Int Rev 






Cytol 206:135-54. 




Flavonoid Inhibition Of 


Brown etal. (2001) Plant 




Auxin Transport Function 


Physiol 126:524-35. 




• Changes In Root Cap Ph 


Fasanoetal. (2001) Plant 




• Starch Synthesis And 


Cell 13:907-22. 




Storage 


MacCleery et al. (1999). 




• Cell Differentiation 


Plant Physiol 120:183-92 
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PROCESS 


BIOCHEMICAL OR 
METABOLIC ACTIVITIES 
AND/OR PATHWAYS 


CITATIONS INCLUDING 
ASSAYS 




• Cell Elongation 


Blancaflor et al. (1998). 
Plant Physiol 116:213-22 
Schneider et al. (1998) Genes 
Devel 12: 2013-21 



Other biological activities that can be modulated by the root genes and gene products 
are listed in the Reference tables. Assays for detecting such biological activities are 
described in the Protein Domain table. 

A. 1 .a. USE OF ROOT GENES TO MODULATE TRANSCRIPTION 
LEVELS OF PLANT GENES 
Many genes are "up regulated " or "down regulated" because they belong to networks 
or cascades of genes. Thus some root genes are capable of regulating many other gene 
activities via these networks and hence complex phenotypes. Examples of transcription 
profiles of root genes are described in the Table below with associated biological activities. 
"Up-regulated" profiles are those where the concentrations of the mRNA in total mRNA are 
higher in roots as compared to aerial parts of a plant; and vice- versa for "down-regulated" 
profiles. 
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EXAMPLES OF 


TRANSCRIPT 


TYPE OF GENES 


PHYSIOLOGICAL 


BIOCHEMICAL 


LEVELS 




CONSEQUENCES 


ACTIVITY 


Up Regulated 


Genes Expressed In 


• Primary Root, 


• Transporters 


Transcripts 


Root Development 


Lateral Root, and/or 


• Metabolic Enzymes 






Root Hair Growth 


• Change In Cell 






and Differentiation 


Membrane Structure 






• Microorganism 


And Potential 




Responders To 


Perception 


• Kinases, 




Micro-Organismal 


• Entrapment Of 


Phosphatases, G- 




Symbionts And 


Microorganismal 


Proteins 




Parasites 


Symbionts 


• Transcription 






• Nutrient Uptake 


Activators 




Genes involved in 


• Synthesis Of 


• Change In 




polar auxin transport 


Metabolites And/Or 


Chromatin Structure 






Proteins 


And/Or Localized 






• Modulation Of 


DNA Topology 






Transduction 


. Cell Wall Proteins 






Pathways 


. Ca^ Fluctuation 






• Specific Gene 


• Reactive Oxygen 




Genes involved in 


Transcription 


Species (ROS) 




starch deposition in 


Initiation 


production 




the roots 


• Nutrient Uptake 








Enhancement 






Genes involved in 


• Gravitropic growth 






production of reactive 


of roots 






oxygen species 


• Associations with 








rhizobia are 






Genes involved in 


stimulated 






flavonoid synthesis 
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EXAMPLES OF 


TRANSCRIPT 


TYPE OF GENES 


PHYSIOLOGICAL 


BIOCHEMICAL 


LEVELS 




CONSEQUENCES 


ACTIVITY 


Down- 


Genes Repressed In 


• Negative 


• Transcription 


Regulated 


Root Development 


Regulation Of 


Factors 


Transcripts 




Primary Root, 


• Kinases, 






Lateral Root, and/or 


Phosphatases, G- 




Responders To 


Root Hair 


Proteins 




Micro-Organismal 


Production 


• Change In 




Symbionts And 


Released 


Chromatin Structure 




Parasites 


• Changes In 


And/Or DNA 






Pathways And 


Topology 






Processes 


. Stability Of Factors 




Genes With 


Operating In Cells 


For Protein 




Discontinued 


• Changes In 


Synthesis And 




Expression Or 


Metabolism 


Degradation 




UnsTable mRNA In 


• Inhibition of root 


• Metabolic En2ymes 




Presence Of Root 


gravitropism 






And/Or Micro- 








Organismal 








Symbionts 







Changes in the function or development of roots are the result of modulation of the 
activities of one or more of these many root genes and gene products. These genes and/or 
products are responsible for effects on traits such as plant vigor and seed yield, especially when 
plants are growing in the presence of soil borne biotic or abiotic stresses or when they are 
growing in barren conditions or in soils depleted of certain minerals. 

Root genes, gene components and gene products can act alone or in combination as 
described in the introduction. Of particular interest are combinations of these genes and gene 
products with those that modulate stress tolerance and/or metabolism. Stress tolerance and 
metabolism genes and gene products are described in more detail in the sections below. 
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Use of Promoters of Root Genes 

Promoters of root genes, as described in the Reference tables, for example, can be used 
to modulate transcription that is induced by root development or any of the root biological 
processes or activities above. For example, when a selected polynucleotide sequence is operably 
linked to a promoter of a root gene, then the selected sequence is transcribed in the same or 
similar temporal, development or environmentally-specific patterns as the root gene from which 
the promoter was taken. The root promoters can also be used to activate antisense copies of any 
coding sequence to achieve down regulation of its protein product in roots. They can also be 
used to activate sense copies of mRNAs by RNA interference or sense suppression in roots. 
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LA.2. ROOT HAIR GENES, GENE COMPONENTS AND PRODUCTS 
Root hairs are specialized outgrowths of single epidermal cells termed trichoblasts. In 
many and perhaps all species of plants, the trichoblasts are regularly arranged around the 
perimeter of the root. In Arabidopsis, for example, trichoblasts tend to alternate with non- 
hair cells or atrichoblasts. This spatial patterning of the root epidermis is under genetic 
control, and a variety of mutants have been isolated in which this spacing is altered or in 
which root hairs are completely absent. 



Root hair genes identified herein are defined as genes, gene components and products 
capable of modulating one or more processes in or the function of root hairs as described below. 
Root hairs are capable of controlling or influencing many plant traits, also as shown below. 
Examples of such root hair development genes and gene products are shown in the Reference 
and Sequence Tables. The protein products of many of these genes are also identified in these 
Tables. 

Root Hair Genes Identified by Differential Expression 

These genes were discovered and characterized from a much larger set of genes by 
experiments designed to find genes whose mRNA products are associated specifically with root 
hairs. These experiments made use of the arabidopsis mutant "root hairless" (rhl), which does 
not develop root hairs. By comparing gene expression profiles of rhl roots with those of wild 
type roots grown in identical conditions, genes specifically expressed in root hairs were 
revealed. The MA_diff Table(s) reports the transcript levels of the experiment (see EXPT ID: 
108594, 108433). For transcripts that had higher levels in the samples than the control, a is 
shown. A is shown for when transcript levels were reduced in root tips as compared to the 
control. For more experimental detail see the Example section below. 

Root Hairs genes are those sequences that showed differential expression as compared to 
controls, namely those sequences identified in the MA_diff tables with a "+" or "-" indication 

Root Hairs Genes Identified By Cluster Analyses Of Differential Expression 

Root Hairs Genes Identified By Correlation To Genes That Are 
Differentially Expressed 



IDENTIFICATION OF ROOT HAIR GENES 
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As described above, the transcription profiles of genes that act together are well 
correlated. Applicants not only have identified the genes that are differentially expressed in the 
microarray experiments, but also have identified the genes that act in concert with them. The 
MA_clust table indicates groups of genes that have well correlated transcription profiles and 
therefore participate in the same pathway or network. 

A pathway or network of Root Hairs genes is any group in the MA_clust that comprises 
a cDNA ID that also appears in Expt ID 108594, 108433 the MA_diff table(s). 



Additionally, the differential expression data and the phenotypic observations can be 
merged to identify pathways or networks of Root Hairs genes. A group in the MA_clust is 
considered a Root Hairs pathway or network if the group comprises a cDNA ID that also 
appears in Knock-in or Knock-out tables that causes one or more of the phenotypes described in 
section above. 



Root Hairs genes from other plant species typically encode polypeptides that share 
amino acid similarity to the sequences encoded by com and Arabidopsis Root Hairs genes. 
Groups of Root Hairs genes are identified in the Protein Group table. In this table, any protein 
group that comprises a peptide ID that corresponds to a cDNA ID member of a Root Hairs 
pathway or network is a group of proteins that also exhibits Root Hairs functions/utilities. 

Examples of phenotypes, biochemical activities, and transcript profiles that can be 
modulated by these genes and gene products are described above and below. 



The root hair development genes of the instant invention are useful to modulate one or 
more processes of root hair structure and/or function including (I) development; (II) 
interaction with the soil and soil contents; (IE) uptake and transport in the plant; and (IV) 
interaction with microorganisms. 



Root Hairs Genes Identified By Correlation To Genes That Cause 



Physiological Consequences 



Root Hairs Genes Identified By Amino Acid Sequence Similarity 



b) USE OF ROOT HAIR DEVELOPMENT GENES TO 
MODULATE PHENOTYPES 
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I. DEVELOPMENT 

The surface cells of roots can develop into single epidermal cells termed trichoblasts 
or root hairs. Some of the root hairs will persist for the life of the plant; others will gradually 
die back; some may cease to function due to external influences. The genes and gene 
products of this invention are useful to odulate any one or all of these growth and 
development process generally, as in root hair density or root hair growth; including rate, 
timing, direction, and size, for example. A more detailed list of processes that are regulated 
by these genes and gene products is as follows: 

A. Cell Properties 

Root hair development genes and gene products modulate cellular 
changes in: 

1. Cell Size 

2. Cell Division, Rate And Direction And Number 

3. Cell Elongation 

4. Cell Differentiation 

5. Lignified Cell Walls 

6. Epidermal Cells, Such As Trichoblasts 

7. Root Apical Meristem Cells (Growth And Initiation) 

B. Root Hair Architecture 

The following parts of a root hair can be modulated to affect root hair 
architecture: 

C. Leaf Cells Under The Trichome 

D. Cells Forming The Base Of The Trichome 

E. Trichome Cells 

F. Root Hair Responses 

The genes and gene products of this invention are useful to modulate any one or all of 
the following growth and development processes in response to internal plant programs or 
environmental stimuli in: 



Seminal System 
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2. Nodal System 

3. Hormone Responses 

4. Auxin (For More Details See The Section On Auxin- 
Responsive Genes) 



7. Gravitropism 

8. Coordination Of Root Growth And Development With That Of 
Other Organs, Examples Include: 

(a) Leaves 

(b) Flowers 

(c) Seeds 

(d) Fruits 

(e) Stems 

9. Changes In Soil Environment (For More Detail See Below) 

(a) Water 

(b) Minerals 



5. 



Root Cap Abscission 



6. 



Root Senescence 



(c) 
(d) 



Ph 



Microfauna And Flora 



n. 



INTERACTION WITH SOIL AND SOIL CONTENTS 



Root hairs are sites of intense chemical and biological activity and as a result can 
strongly modify the soil they contact. Roots hairs can be coated with surfactants and 
mucilage to facilitate these activities. Specifically, roots hairs are responsible for nutrient 
uptake by mobilizing and assimilating water, reluctant ions, organic and inorganic 
compounds and chemicals. In addition, they attract and interact with beneficial microfauna 
and flora. Root hairs also help to mitigate the effects of toxic ions, pathogens and stress. 
Examples of root hair properties and activities that the genes and gene products of the 



invention are useful to modulate are as follows: 



A. 



Root Hair Surfactant And Mucilage 
1. Mucilage 

(a) Composition 



(b) 



Secretion Rate And Time 
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2. Surfactant 

B. Nutrient Uptake 

1. Water 

2. Nitrate And Other Sources Of Nitrogen(For Detail See The 
Section On Nitrogen Responsive Genes) 

3. Phosphate 

4. Potassium 

5. Micronutrients (E.G. Iron, Copper, Etc.) 

C. Microbe And Nematode Associations 

1 . Bacteria Including Nitrogen-Fixing Bacteria 

2. Mycorrhizae 

3. Nodule-Forming And Other Nematodes 

4. Nitrogen Fixation 

D. Oxygen 

I . Transpiration 

E. Detoxification Effects Of 

7. Iron 

8. Aluminum 

9. Cadium 

10. Mercury 

II. Salt 

12. Other Soil Constituents 

F. Pathogens 

1. Chemical Repellent 



(a) 



Glucosinolates (GSL), Which Release Pathogen- 
Controlling Isothiocyanates 



G. 



Changes In Soil 



1. 



Ph 



2. 



Mineral Excess And Depletion 



3. 



Rhizosheath 
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m. TRANSPORT OF MATERIALS IN PLANTS 

Introduction: Uptake of the nutrients by the root and root hairs contributes a source- 
sink effect in a plant. The greater source of nutrients, the more sinks, such as stems, leaves, 
flowers, seeds, fruits, etc. can draw sustenance to grow. Thus, root hair development genes 
and gene products are useful to modulate the vigor and yield of the plant overall as well as of 
distinct cells, organs, or tissues of a plant. Examples are as follows: 
A. Vigor 

1. Plant Nutrition 

2. Growth Rate 

(a) Whole Plant, Including Height, Flowering Time, Etc. 

(b) Seedling 

(c) Coleoptile Elongation 

(d) Young Leaves 

(e) Stems 

(f) Flowers 

(g) Seeds 

(h) Fruit 

3. Yield 

(a) Biomass 

• Fresh And Dry Weight During Any Time In Plant 
Life, Including Maturation And Senescence 

(b) Number Of Flowers 

(c) Number Of Seeds 

(d) Seed Yield 

(e) Number, Size, Weight, Harvest Index 

• Content And Composition, E.G. Amino Acid, 
Jasmonate, Oil, Protein And Starch 

(f) Fruit Yield 

• Number, Size, Weight, Harvest Index, Post Harvest 
Quality 
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Additional Uses of Plants with Modified Root Hairs 
Plants with root hairs modified in one or more of the properties described above in I, 
II or in are used to provide: 

A. Higher vigor and yield of plant and harvested products due to pathogen 
resistance from conditioning the soil with plant-derived chemicals 
and/or more tolerance to stresses such as drought, flooding and anoxia 

B. Better Animal (Including Human) Nutrition 

C. Improved Dietary Mineral Nutrition 

D. Increased Plant Survival By Decreasing Lodging 

E. Better Plant Survival By: 

(a) Decreased Lodging 

(b) More Efficient Transport 

(c) More Efficient Physiology 

(d) More Efficient Metabolism 

F. Increased Yield Of Valuable Molecules 

Root Hair Modulation 

To regulate any of the phenotype(s) above, activities of one or more of the root hair 
genes or gene products is modulated and tested by screening for the desired trait. 
Specifically, the gene, mRNA levels, or protein levels are altered in a plant utilizing the 
procedures described herein and the phenotypes can be assayed. As an example, a plant can 
be transformed according to Bechtold and Pelletier (1998, Methods. Mol. Biol. 82:259-266) 
and/or screened for variants as in Winkler et al. (1998) Plant Physiol 118: 743-50 and 
visually inspected for the desired phenotype or metabolically and/or functionally assayed 
according to Dolan et al. (1993, Development 1 19: 71-84), Dolan et al. (1997, Development 
124: 1789-98), Crawford and Glass (1998, Trends Plant Science 3: 389-95), Wang et al 
(1998, PNAS USA 95: 15134-39), Gaxiola et al. (1998, PNAS USA 95: 4046-50), Apse et al. 
(1999, Science 285: 1256-58), Fisher and Long (1992, Nature 357: 655-60), Schneider et al. 
(1998, Genes Devel 12: 2013-21) and Hirsch (1999, Curr Opin Plant Biol. 2: 320-326). 



A.2.a. USE OF ROOT HAIR DEVELOPMENT GENES TO 



MODULATE BIOCHEMICAL ACTIVITIES 
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The activities of one or more of the root hair development genes can be modulated to 
change biochemical or metabolic activities and/or pathways such as those noted below. Such 
biological activities can be measured according to the citations included in the Table below: 



Process 


Biochemical Or Metabolic 
Activities And/Or Pathways 


Citations Including Assays 


Association Of Root 
Hair With Nitrogen 
Fixing Bacteria 


• Functions Associated 
With Root Hair Curling 
And Signal Transduction 


Gage et al. (1996) J 
Bacterid 178: 7159-66 


Root Hair 

• Spacing 

• Initiation 

• Elongation 




Schneider et al. (1998) 
Genes Devel 12: 2013-21 


Metabolism 


♦ Organic Molecule Export 


Moody et al. (1988) 
Phytochemistry 27: 2857-61 




• Ion Export 


Uozumi et al. (2000) Plant 
Physiol 122: 1249-59 
Frachisse et al. (2000) Plant 
J 21: 361-71 


Nutrient Uptake 


• Nutrient Uptake 


Frachisse et al. (2000) Plant 
J 21: 361-71 

Uozumio et al. (2000) Plant 
Physiol 122: 1249-59 



Other biological activities that can be modulated by the root hair genes and gene 
products are listed in the Reference tables. Assays for detecting such biological activities are 
described in the Protein Domain table. 



A.2.a. USE OF ROOT HAIR GENES, GENE COMPONENTS AND 
PRODUCT TO MODULATE TRANSCRIPTION LEVELS 
Many genes are "up regulated" or "down regulated" in root hairs or associated with 
root hair formation because genes are regulated in networks. Thus some root hairs genes are 
useful to regulate the activities of many other genes, directly or indirectly to influence 
complex phenotypes. Examples of transcription profiles of root genes are described in the 
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Table below with associated biological activities. "Up regulated" profiles are those where 
the mRNAlevels are higher when the rhl gene is inhibited as compared to when rhl gene is 
not inhibited; and vice- versa for "down-regulated" profiles. 
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Transcript 


Type Of Genes 


Physiological 


Examples Of 


Levels 




Consequences 


Biochemical Activity 


Down 


• Genes Expressed In 


• Root Hair Formation 


• Transporters 


Regulated 


Root Hair 


• Microorganism 


• Metabolic Enzymes 


Transcripts 


Development 


Perception 


• Change In Cell 






• Entrapment Of 


Membrane Structure 




Responders To 


Microorganismal 


And Potential 




Micro-Organismal 


Symbionts 


• Kinases, 




Symbionts And 


• Nutrient Uptake 


Phosphatases, G- 




Parasites 


• Synthesis Of 


Proteins 






Metabolites And/Or 


• Transcription 






Proteins 


Activators 






• Modulation Of 


• Change In Chromatin 






Transduction 


Structure And/Or 






Pathways 


Localized DNA 






• Specific Gene 


Topology 






Transcription 


. Cell Wall Proteins 






Initiation 








• Nutrient Uptake 








Enhancement 




Up-Regulated 


• Genes Repressed In 


• Negative Regulation 


• Transcription Factors 


Transcripts 


Roots Making 


Of Hair Production 


• Kinases, 




Hairs 


Released 


Phosphatases, G- 






• Changes In 


Proteins 






Pathways And 


• Change In Chromatin 




• Responders To 


Processes Operating 


Structure And/Or 




Micro-Organismal 


In Cells 


DNA Topology 




Symbionts And 


• Changes In 


. Stability Of Factors 




Parasites 


Metabolism 


For Protein Synthesis 








And Degradation 
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Transcript 


Type Of Genes 


Physiological 


Examples Of 


Levels 




Consequences 


Biochemical Activity 








• Metabolic Enzymes 




. Genes With 




. Cell Wall Proteins 




Discontinued 








Expression Or 








UnsTable mRNAIn 








Presence Of Root 








Hairs And/Or 








Micro-Organismal 








Symbionts 







□ Changes in the patterning or development of root hairs are the result of modulation of the 

I % activities of one or more of these many root hair genes and gene products. These genes and/or 

CP products are responsible for effects on traits such as plant vigor and seed yield, especially when 

h5 plants are growing in the presence of biotic or abiotic stresses or when they are growing in 

^ barren conditions or in soils depleted of certain minerals. 

41 Root hair genes and gene products can act alone or in combination as described in the 

p introduction. Of particular interest are combination of these genes and gene products with those 

that modulate stress tolerance and/or metabolism. Stress tolerance and metabolism genes and 

1 0 gene products are described in more detail in the sections below. 

Use of Promoters of Root Hair Genes 

Promoters of root hair development genes, as described in the Reference tables, for 
example, are useful to modulate transcription that is induced by root hair development or any of 
15 the following phenotypes or biological activities above. For example, any desired sequence can 
be transcribed in similar temporal, tissue, or environmentally-specific patterns as the root hair 
genes when the desired sequence is operably linked to a promoter of a root hair responsive gene. 
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LB. LEAF GENES, GENE COMPONENTS AND PRODUCTS 

LB.l. LEAF GENES, GENE COMPONENTS AND PRODUCTS 
Leaves are responsible for producing most of the fixed carbon in a plant and are critical 
to plant productivity and survival. Great variability in leaf shapes and sizes is observed in nature. 
Leaves also exhibit varying degrees of complexity, ranging from simple to multi-compound. 
Leaf genes as defined here, not only modulate morphology, but also influence the shoot apical 
meristem, thereby affecting leaf arrangement on the shoot, internodes, nodes, axillary buds, 
photosynthetic capacity, carbon fixation, photorespiration and starch synthesis. Leaf genes 
elucidated here can be used to modify a number of traits of economic interest from leaf shape to 
plant yield, including stress tolerance, and to modify the efficiency of synthesis and 
accumulation of specific metabolites and macromolecules. 

B. 1 .a. IDENTIFICATION OF LEAF GENE, GENE COMPONENTS 

AND PRODUCTS 

Leaf genes identified herein are defined as genes, active or potentially active to 
greater extent in leaves than in some other organs of the plant or as genes that affect leaf 
properties. These genes and gene components are useful for modulating one or more 
processes in or functions of leaves, as described below, to improve plant traits ranging from 
yield to stress tolerance. Examples of such leaf genes and gene products are shown in the 
Reference and Sequence Tables and sequences encoding polypeptides of the Protein Group 
and Protein Group Matrix tables or fragments thereof, Knock-In, Knock-Out and MA_diff 
Tables. The biochemical functions of the protein products of many of these genes 
determined from comparisons with known proteins are also given in the Reference tables. 

Leaf Genes Identified by Phenotypic Observations 

Some leaf genes were discovered and characterized from a much larger set of genes 
by experiments designed to find genes that cause phenotypic changes in leaf, petiole, 
internode, and cotyledon morphology. 

In these experiments, leaf genes were identified by either (1) ectopic expression of a 
cDNA in a plant or (2) mutagenesis of the plant genome. The plants were then cultivated and 
one or more of the following leaf phenotypes, which varied from the parental "wild-type", 
were observed: 
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Changes In Seedling Stage Cotyledons 

• Cup Shaped 
. Curled 

• Horizontally Oblong 

• Long Petioles 

• Short Petioles 

• Silver 
. Tricot 
. Wilted 

Changes In Rosette And Flowering Stage Leaf Shapes 

• Cordate 

• Cup-Shaped 
. Curled 

• Fused 

• Lanceolate 

• Lobed 

• Long Petioles 

• Short Petioles 
. Oval 

• Ovate 

• Serrate 
. Trident 

• Undulate 

• Vertically Oblong 

Changes In Cauline, Flowering Leaf Shape 

• Misshapen 
. Other 

Changes In Leaf Pigment 

• Albino 

• Dark Green Pigment 

• High Anthocyanin 

• Interveinal Chlorosis 
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• Yellow Pigment 

E. Changes In Leaf Size 

F. Changes In Seedling Stage Hypocotyl 

• Long 
. Short 

G. Changes In Leaf Number 

H. Changes In Wax Deposition 

• Glossy Rosette And Flowering Stage Leaves 

• Altered Wax Deposition On The Bolt 



Leaf Genes Identified by Differential Expression 

Also, leaf genes were identified in experiments in which the concentration of mRNA 
products in the leaf, or stem, or Knock-out mutant 3642-1 were compared with to a control. The 
MA_diff Table(s) reports the transcript levels of the experiment (see EXPT ID: 108477, 108512, 
108497, 108498, 108598). For transcripts that had higher levels in the samples than the control, 
a "+" is shown. A "-" is shown for when transcript levels were reduced in root tips as compared 
to the control. For more experimental detail see the Example section below. 

Leaf genes are those sequences that showed differential expression as compared to 
controls, namely those sequences identified in the MA_diff tables with a "+" or "-" indication 

Leaf Genes Identified By Cluster Analyses Of Differential Expression 

Leaf Genes Identified By Correlation To Genes That Are Differentially 

Expressed 

As described above, the transcription profiles of genes that act together are well 
correlated. Applicants not only have identified the genes that are differentially expressed in the 
microarray experiments, but also have identified the genes that act in concert with them. The 
MA_clust table indicates groups of genes that have well correlated transcription profiles and 
therefore participate in the same pathway or network. 

A pathway or network of Leaf genes is any group in the MA_clust that comprises a 
cDNA ID that also appears in Expt ED 108477, 108512, 108497, 108498, 108598 of the 
MA_difftable(s). 
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Leaf Genes Identified By Correlation To Genes That Cause 



Physiological Consequences 

Additionally, the differential expression data and the phenotypic observations can be 
merged to identify pathways or networks of Leaf genes. A group in the MA_clust is considered 
a Leaf pathway or network if the group comprises a cDNA ID that also appears in Knock-in or 
Knock-put tables that causes one or more of the phenotypes described in section above. 



Leaf genes from other plant species typically encode polypeptides that share amino acid 
similarity to the sequences encoded by corn and Arabidopsis Leaf genes. Groups of Leaf genes 
are identified in the Protein Group table. In this table, any protein group that comprises a 
peptide ID that corresponds to a cDNA ID member of a Leaf pathway or network is a group of 
proteins that also exhibits Leaf functions/utilities. 

It is assumed that (i) the genes preferentially expressed in leaves are concerned with 
specifying leaf structures and the synthesis of all the constituent molecules and (ii) that the genes 
repressed in leaves specify products that are not required in leaves or that could inhibit normal 
leaf development and function. 

Examples of phenotypes, biochemical activities, and transcription profiles that are 
modulated by using selected members of these genes and gene products, singly or in 
combination, are described below. 



Leaves are critical for the performance and industrial utility of plants. There is 
extensive evidence that the number, size, shape, position, timing of synthesis, timing of 
senescence and chemical constitution are very important for agriculture, horticulture and uses 
of plants as chemical factories for making valuable molecules. Many improvements already 
demonstrated over past decades have involved genetic modifications to leaves. Therefore, 
the leaf genes and gene components of this invention offer considerable opportunities for 
further improving plants for industrial purposes. When the leaf genes and/or gene 
components are mutated or regulated differently, they are capable of modulating one or more 
of the processes determining leaf structure and/or function including (I) development; (II) 
interaction with the environment and (HI) photosynthesis and metabolism. 



Leaf Genes Identified By Amino Acid Sequence Similarity 



B.l.b. USE OF LEAF GENES, GENES COMPONENTS AND 
PRODUCTS TO MODULATE PHENOTYPES 
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I. DEVELOPMENT 

The leaf genes, gene components and products of the instant invention are useful to 
modulate one or more processes of the stages of leaf morphogenesis including: stage 1- 
organogenesis that gives rise to the leaf primordium; stage 2- delimiting basic morphological 
domains; and stage 3- a coordinated processes of cell division, expansion, and differentiation. 
Leaf genes include those genes that terminate as well as initiate leaf development. 
Modulating any or all of the processes leads to beneficial effects either at specific locations or 
throughout the plant. 



A. Gene Sequences Affecting Types of Leaves 

Applicants provide with these genes, gene components and gene products the means 
to modulate one or more of the following types of leaves, shoots, and stems: 

1. Cotyledons 

2. Major Leaves 

3. Cauline Leaves 

4. Petioles 

B. Gene Sequences Affecting Cell properties 

Leaf genes, gene components and gene products are useful to modulate changes in: 

1. Cell Size 

2. Cell Division, Rate And Direction 

3. Cell Elongation 

4. Cell Differentiation 

5. Stomata Size, Number, Spacing And Activity 

6. Trichome Size And Number (For More Details See Section On 
Trichome Genes) 

7. Xylem And Phloem Cell Numbers 

8. Cell Wall Composition 

9. All Cell Types 

C. Gene Sequences Affecting Leaf Architecture: 

The following properties of a leaf are useful to modulate to change overall leaf 
architecture: 
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1 . Veination - Improvements in photosynthetic efficiency, stress 
tolerance efficiency of solute and nutrient movement to and 
from the leaf are accomplished by increases or decreases in: 

(a) Vein Placement 

(b) Number Of Cells In The Vein 

2. Shape 

(a) Elongated Versus Rounded 

(b) Symmetry, Around Either 

• Abaxial-Adaxial (Dorsiventral) Axis 

• Apical-Basal (Proximodistal) Axis 

• Margin-Blade-Midrib (Lateral) Axis 

3. Branching - Improved plant performance to biotic and abiotic 
stress in heavy density planting is achieved by increases or 
decreases in: 

(a) Leaf branch position 

(b) Leaf branch length 



G. Genes Sequences Influencing Leaf Responses 
Shoot apical meristem cells differentiate to become leaf primordia that eventually 
develop into leaves. The genes, gene components and gene products of this invention are 
useful to modulate any one or all of these growth and development processes, by affecting 
timing and rate or planes of cell divisions for example, in response to the internal plant 
stimuli and/or programs listed below: 

1. Embryogenesis 

2. Germination 

3. Hormones 

(a) Auxin (For More Details See The Section On Auxin- 
Responsive Genes) 

4. Leaf Senescence 

5. Phototropism 

6. Coordination Of Leaf Growth And Development With That Of 
Other Organs 

(a) Roots 
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(b) Flowers 

(c) Seeds 

(d) Fruits 

(e) Stems 

7. Stress-Related Programs 



H. INTERACTION WITH THE ENVIRONMENT 

Leaves are the main sites of photosynthesis and have various adaptations for that 
purpose. Flat laminae provide a large surface for absorbing sunlight; leaves are rich in 
chloroplasts and mitochondria; stomata in the lower surface of the laminae allow gases to 
pass into and out of the leaves including water; and an extensive network of veins brings 
water and minerals into the leaves and transports the sugar products produced by 
photosynthesis to the rest of the plant. Examples of leaf properties or activities that are 
modulated by leaf genes, gene components and their products to facilitate interactions 
between a plant and the environment include: 



A. Pigment Accumulation (See The Section On Viability Genes For More 
Detail) 

B. Wax accumulation on the surface of leaves - Improved protection of 
young leaves from water bome pathogen attack such as Downey 
Mildew with increased wax production. 

C. Oxygen Gain/Loss Control 

D. Carbon Dioxide Gain/Loss Control 

E. Water Gain/Loss Control 

F. Nutrient Transport 

G. Light Harvesting 

H. Chloroplast Biogenesis 

I. Circadian Rhythm Control 
J. Light/Dark Adaptation 

K. Defense Systems Against Biotic And Abiotic Stresses 
L. Metabolite Accumulation 

M. Secondary metabolite production in leaf mesophyl, epidermis and 
trichomes. 
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(a) Increases in antifeeding secondary metabolites such as 
strictosiden reduce herbivory. 

(b) Decreases in secondary metabolites improve plants as forage by 
reducing allergens or undigestible compounds. 



m. PHOTOSYNTHESIS AND METABOLISM 

Many of the uses for plants depend on the success of leaves as the powerhouses for 
plant growth, their ability to withstand stresses and their chemical composition. Leaves are 
organs with many different cell types and structures. Most genes of a plant are active in 
leaves and therefore leaves have very diverse of pathways and physiological processes. 
Examples of such pathways and processes that are modulated by leaf genes, gene components 
and products include: 



A. 


Photosynthesis 


B. 


Sugar Metabolism 


C. 


Starch Synthesis 


D. 


Starch Degradation 


E. 


Nitrate And Ammonia Metabolism 


F. 


Amino Acid Biosynthesis, Transport 


G. 


Protein Biosynthesis 


H. 


DNA Replication, Repair 


I. 


Lipid Biosynthesis And Breakdown 


J. 


Protein Biosynthesis, Storage And Breakdown 


K. 


Nucleotide Transport And Metabolism 


L. 


Cell Envelope Biogenesis 


M. 


Membrane Formation 


N. 


Mitochondrial And Chloroplast Biogenesis 


O. 


Transcription And RNA Metabolism 


P. 


Vitamin Biosynthesis 


Q- 


Steroid And Terpenoid Biosynthesis 


R. 


Devise Secondary Metabolite Synthesis 


S. 


Co-Enzyme Metabolism 


T. 


Flavonoid Biosynthesis And Degradation 


U. 


Synthesis Of Waxes 
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V. Glyoxylate Metabolism 

W. Hormone Perception And Response Pathways 



Uses of Plants that Are Modified as Described Above 

Altering leaf genes or gene products in a plant modifies one or more of the following 
plant traits, to make the plants more useful for specific purposes in agriculture, horticulture 
and for the production of valuable molecules. The useful plants have at least one of the 
following: 

A. A higher yield of leaves and their molecular constituents due to 
different: 

1 . Number, Size, Weight, Harvest Index 

2. Composition Including And Amounts And Types Of 
Carbohydrates, Proteins, Oils, Waxes, Etc. 

3. Photosynthetic Efficiency E.G. Reduced 
Photorespiration 

4. Absorption Of Water And Nutrients To Enhance Yields, 
Including Under Stresses Such As High Light, 
Herbicides, And Heat. 

5. Pathways To Accumulate New Valuable Molecules. 

B. More optimal leaf shape and architecture - enhancing photosynthesis 
and enhancing appeal in ornamental species 

(a) Size 

(b) Number 

(c) Pigment 

(d) Aroma 

C. A better overall plant architecture - enhancing photosynthesis and 
enhancing appeal in ornamental species 

(a) Petals 

(b) Sepals 

(c) Stamens 

(d) Carpels 
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Better shade avoidance for maximizing photosynthesis by, for 

example, altering leaf placement, to improve light capture and 

photosynthetic efficiency, thereby increasing yields 

Reduced negative effects of high planting density, by altering leaf 

placement to be more vertical instead of parallel to the ground, for 

instance 

More resistance to the deleterious effects of wind and mechanical 
damage. 

Better stress tolerance, including without limitation 

1 . Drought resistance, by decreasing water loss, for example 

2. Pathogen resistance, including, for instance, 



Better overall yield and vigor 

Plant yield of biomass and of constituent molecules and plant vigor are 
modulated to create benefits by genetically changing: 

1. Growth Rate Of 

(a) Whole Plant, Including Height, Flowering Time, Etc. 

(b) Seedling 

(c) Coleoptile Elongation 

(d) Young Leaves 

(e) Flowers 

(f) Seeds 

(g) Fruit 

2. Biomass 

(a) Fresh And Dry Weight During Any Time In Plant Life, 
Including Maturation And Senescence 

(b) Number Of Flowers 

(c) Seed Yield Including, For Example, 

• Number, Size, Weight, Harvest Index 

• Content And Composition, E.G. Amino Acid, 



Insect resistance through internal insecticide levels and 
optimizing the leaf shape to prevent runoff of 



insecticides 



Jasmonate, Oil, Protein And Starch 
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(d) Fruit Yield 

. Number, Size, Weight, Harvest Index 
. Content And Composition, E.G. Amino Acid, 
Jasmonate, Oil, Protein And Starch 



To change any of the phenotype(s) in I, II, or EI above, activities of one or more of 
the leaf genes or gene products are modulated in an organism and the consequence evaluated 
by screening for the desired trait. Specifically, the gene, mRNA levels, or protein levels are 
altered in a plant utilizing the procedures described herein and the phenotypes can be 
assayed. As an example, a plant can be transformed according to Bechtold and Pelletier 
(Methods. Mol. Biol. 82:259-266 (1998)) with leaf gene constructs and/or screened for 
variants as in Winkler et al., Plant Physiol. 118: 743-50 (1998) and visually inspected for the 
desired phenotype and metabolically and/or functionally assayed for altered levels of relevant 
molecules. 



B. 1 .c. USE OF LEAF GENES, GENE COMPONENTS AND PRODUCTS 

TO MODULATE BIOCHEMICAL ACTIVITIES 
Leaves are complex organs and their structure, function and properties result from the 
integration of many processes and biochemical activities. Some of these are known from the 
published literature and some can be deduced from the genes and their products described in 
this application. Leaf genes, and gene components are used singly or in combination to 
modify these processes and biochemical activities and hence modify the phenotypic and trait 
characteristics described above. Examples of the processes and metabolic activities are given 
in the Table below. The resulting changes are measured according to the citations included in 
the Table. 



PROCESS 


BIOCHEMICAL OR 
METABOLIC ACTIVITIES 
AND/OR PATHWAYS 


CITATIONS 

INCLUDING 

ASSAYS 


Metabolism - anabolic 
and catabolic 


. Farnesylation 

. Cell Wall Biosynthesis 

• Nitrogen Metabolism 

• Secondary Metabolite 


Pei et al., Science 282: 287- 
290 (1998); Cutler etal., 
Science 273: 1239(1996) 
Goupil et al., J Exptl. Botany 
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PROCESS 


BIOCHEMICAL OR 


CITATIONS 




METABOLIC ACTIVITIES 


INCLUDING 




AND/OR PATHWAYS 


ASSAYS 




Biosynthesis and 


49:1855-62(1998) 




Degradation 


Walch-Liu et al., J Exppt 






Botany 51, 227-237 (2000) 


Water Conservation And 


• Stomatal Development And 


Allen et al., Plant Cell 11: 


Resistance To Drought 


Physiology 


1785-1798 (1999) 


And Other Related 


• Production of polyols 


Li et al., Science 287: 300- 


Stresses 


• Regulation of salt 


303 (2000) 




concentration 


Burnett et al., J Exptl. Botany 




• ABA response(s) 


51: 197-205 (2000) 






Raschke, In: Stomatal 






Function, Zeiger et al. Eds., 




• Ca2+ Accumulation 


253-279 (1987) 




• K+ Fluxes 




Transport Anion and 


• Na+ Fluxes 


Lacombe et al., Plant Cell 12: 


Cation Fluxes 


• Receptor - ligand binding 


837-51 (2000); 




• Anion and Cation fluxes 


Wang et al., Plant Physiol. 






118:1421-1429(1998); 






Shi et al., Plant Cell 11:2393- 






2406 (1999) 






Gaymard et al., Cell 94:647- 






655 (1998) 






Jonak et al., Proc. Natl. Acad. 






Sci. 93: 11274-79(1996); 






Sheen, Proc. Natl. Acad. Sci. 






95: 975-80 (1998); 






Allen et al., Plant Cell 11: 






1785-98 (1999) 


Carbon Fixation 


• Calvin Cycle 


Wingler et al., Philo Trans R 




Photorespiration 


Soe LondBBiol Sci 355, 
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PROCESS 


BIOCHEMICAL OR 
METABOLIC ACTIVITIES 
AND/OR PATHWAYS 


CITATIONS 

INCLUDING 

ASSAYS 




- Oxygen evolution 


1517-1529 (2000); 




- RuBisCO 






• Chlorophyll metabolism 


Palecanda et al., Plant Mol 




• Chloroplast Biogenesis and 


Biol 46, 89-97 (2001); 




Metabolism 


Baker et al., J Exp Bot 52, 




• Fatty Acid and Lipid 


615-621 (2001) 




Biosynthesis 






• Glyoxylate metabolism 


Chen et al., Acta Biochim Pol 






41, 447-457(1999) 




• Sugar Transport 


Imlau et al., PlantCell H, 309- 




• Starch Biosynthesis and 


322(1999) 




Degradation 




Hormone Perception and 


• Hormone Receptors and 




Growth 


Downstream Pathways for 






- ethylene 


Tieman et al., Plant J 26, 47- 




- jasmonic acid 


58 (2001) 




- brassinosteroid 


Hilpert et al., Plant J 26, 435- 




- gibberellin 


446 (2001) 




- auxin 


Wenzel et al., Plant Phys 






124, 813-822 (2000) 




cytokinin 


Dengler and Kang, Curr Opin 






Plant Biol 4, 50-56 (2001) 




• Activation Of Specific 


Tantikaniana et al., Genes 




Kinases And Phosphatases 


Devl5, 1577-1580(2001) 



Other biological activities that are modulated by the leaf genes and gene products are 
listed in the Reference tables. Assays for detecting such biological activities are described in 
the Protein Domain table, for example. 
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B.l.d. USE OF LEAF GENES, GENE COMPONENTS AND PRODUCTS 

TO MODULATE TRANSCRIPTION LEVELS 
The expression of many genes is "upregulated" or downregulated" in leaves because 
some leaf genes and their products are integrated into complex networks that regulate 
transcription of many other genes. Some leaf genes, gene components and products are 
therefore useful for modifying the transcription of other genes and hence complex 
phenotypes, as described above. Profiles of leaf gene activities are described in the Table 
below with associated biological activities. "Up-regulated" profiles are those where the 
mRNA transcript levels are higher in leaves as compared to the plant as a whole. "Down- 
regulated" profiles represent higher transcript levels in the whole plant as compared to leaf 
tissue only. 
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TRANSCRIPT 


TYPE OF GENES 


PHYSIOLOGICAL 


EXAMPLES OF 


LEVELS 


WHOSE 


CONSEQUENCES OF 


BIOCHEMICAL 




TRANSCRIPTS 


MODIFYING GENE 


ACTIVITIES OF GENE 




ARE CHANGED 


PRODUCT LEVELS 


PRODUCTS WITH 








MODIFIED LEVELS 


Up Regulated 


Genes Involved In 


• Leaf Cells 


• Transcription 


Transcripts 


Leaf Cell 


Proliferate And 


Factors, Signal 




Differentiation, Cell 


Differentiate; 


Transduction 




Division, Cell 




Proteins, Kinase 




Expansion 




And Phosphatases 




Genes Involved In 


• Leaf Structures 


• Chromatin 




Positive Regulation 


Form And Expand 


Remodeling 




Of Leaf Genes 




• Hormone 








Biosynthesis 




Repressors Of Root 




Enzymes 




And Other Non Leaf 




• Receptors 




Cell Types 










• Photosynthesis And 


• Light Harvesting 






Plastid 


Coupled To ATP 






Differentiation 


Production 








• Chlorophyll 








Biosynthesis 




Genes Involved In 


• Calvin Cycle 


• Ribulose 




Photosynthesis 


Activated 


Bisphosphate 






• Chloroplast 


Carboxylase 






Biogenesis And 


• Chloroplast 






Plastid 


Membranes 






Differentiation 


Synthesis 






Activated 


• Chloroplast 








Ribosome 








Biogenesis 
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TRANSCRIPT 


TYPE OF GENES 


PHYSIOLOGICAL 


EXAMPLES OF 


LEVELS 


WHOSE 


CONSEQUENCES OF 


BIOCHEMICAL 




TRANSCRIPTS 


MODIFYING GENE 


ACTIVITIES OF GENE 




ARE CHANGED 


PRODUCT LEVELS 


PRODUCTS WITH 








MODIFIED LEVELS 






• Starch Biosynthesis 


• Starch Synthase 






• Lipid Biosynthesis 








• Nitrogen 


• Nitrate Reductase 






Metabolism - NO3 


• Terpenoid 




Other Genes 


Reduced And 


Biosynthesis 




Involved In 


Amino Acids Made 


• Transcription 




Metabolism 


• Secondary 


Factors 






Metabolites 


• Transporters 






Produced 


• Kinases 








Phosphatases And 








Signal Transduction 








Protein 








• Chromatin Structure 








Modulators 






• Leaf Genes 


• Transcription 






Activated And Leaf 


Factors 






Functions Induced; 


• Signal Transduction 








Proteins - Kinases 




Genes Involved In 




And Phosphatases 


Down 


Negative Regulation 


• Dark-Adapted 


• Metabolic Enzymes 


Regulated 


Of Leaf Genes 


Metabolism 




Genes 




Suppressed 








• Meristematic Genes 


♦ Chromatin 






Suppressed 


Remodeling Proteins 






• Leaf Metabolic 








Pathways Induced 
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While leaf polynucleotides and gene products are used singly, combinations of these 
polynucleotides are often better to optimize new growth and development patterns. Useful 
combinations include different leaf polynucleotides and/or gene products with a hormone 
responsive polynucleotide. These combinations are useful because of the interactions that 
exist between hormone-regulated pathways, nutritional pathways and development. 

Use of Leaf Gene Promoters 

Promoters of leaf genes are useful for transcription of desired polynucleotides, both 
plant and non-plant. If the leaf gene is expressed only in leaves, or specifically in certain 
kinds of leaf cells, the promoter is used to drive the synthesis of proteins specifically in those 
cells. For example, extra copies of carbohydrate transporter cDNAs operably linked to a leaf 
gene promoter and inserted into a plant increase the "sink" strength of leaves. Similarly, leaf 
promoters are used to drive transcription of metabolic enzymes that alter the oil, starch, 
protein, or fiber contents of a leaf. Alternatively, leaf promoters direct expression of non- 
plant genes that can, for instance, confer insect resistance specifically to a leaf. Additionally 
the promoters are used to synthesize an antisense mRNA copy of a gene to inactivate the 
normal gene expression into protein. The promoters are used to drive synthesis of sense 
RNAs to inactivate protein production via RNA interference. 
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I.B.2. TRICHOME GENES AND GENE COMPONENTS 
Trichomes, defined as hair-like structures that extend from the epidermis of aerial 
tissues, are present on the surface of most terrestrial plants. Plant trichomes display a diverse 
set of structures, and many plants contain several types of trichomes on a single leaf. The 
5 presence of trichomes can increase the boundary layer thickness between the epidermal tissue 
and the environment, and can reduce heat and water loss. In many species, trichomes are 
thought to protect the plant against insect or pathogen attack, either by secreting chemical 
components or by physically limiting insect access to or mobility on vegetative tissues. The 
stellate trichomes of Arabidopsis do not have a secretory anatomy, but at a functional level, 
10 they might limit herbivore access to the leaf in the field. In addition, trichomes are known to 
secrete economically valuable substances, such as menthol in mint plants. 

B.2.a. IDENTIFICATION OF TRICHOME GENES, GENE 

C COMPONENTS AND PRODUCTS 



Trichome genes identified herein are defined as genes or gene components capable of 

% modulating one or more processes in or functions of a trichome, as described below. These 

FU genes, their components and products are useful for modulating diverse plant traits from 

ill 

Cfi production of secondary metabolites to pathogen resistance. Examples of such trichome 

%h genes and gene products are shown in the Reference and Sequence Tables and sequences 

SiO encoding polypeptides of the Protein Group and Protein Group Matrix tables or fragments 

f! i 

£ thereof, Knock-in, Knock-out, MA-diff and MA-clust. The biochemical functions of the 
~ protein products of many of these genes determined from comparisons with known proteins 
are also given in the Reference tables. 

25 Trichome Genes Identified by Phenotypic Observation 

Trichome genes were discovered and characterized from a much larger set of genes 
by experiments designed to find genes that cause phenotypic changes in trichome number and 
morphology on leaf, internode, cotyledon, petiole, and inflorescence. In these experiments, 
trichome genes were identified by either (1) ectopic expression of a cDNA in a plant or (2) 
30 mutagenesis of the plant genome. The plants were then cultivated and one or more of the 
following phenotypes, which varied from parental "wild-type", were observed: (1) trichome 
number; (2) trichome spacing (clustering); or (3) trichome branching. The genes regulating 
« trichome phenotypes are identified in the Knock-In and Kncok-Out Tables. 
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Trichome Genes Identified by Differential Expression 
Trichome genes were also discovered and characterized from a much larger set of 
genes by experiments designed to find genes whose mRNA products are associated 
5 specifically or preferentially with trichomes. These experiments made use of an Arabidopsis 
glaborous mutant and a hairy mutant. By comparing gene expression profiles of the glabrous 
mutant with those of the hairy mutant grown under identical conditions, genes specifically or 
preferentially expressed in trichomes were revealed. The MA_diff Table(s) reports the 
transcript levels of the experiment (see EXPT ID: 108452). For transcripts that had higher 
10 levels in the samples than the control, a "+" is shown. A "-" is shown for when transcript 

levels were reduced in root tips as compared to the control. For more experimental detail see 
the Example section below. 

Trichome genes are those sequences that showed differential expression as compared 

0 to controls, namely those sequences identified in the MA_diff tables with a "+" or "-" 
%jj5 indication 

fU Trichome Genes Identified By Cluster Analyses Of Differential Expression 

1 f i 

J*] Trichome Genes Identified By Correlation To Genes That Are 

L Differentially Expressed 

mo As described above, the transcription profiles of genes that act together are well 

correlated. Applicants not only have identified the genes that are differentially expressed in 
^ the microarray experiments, but also have identified the genes that act in concert with them. 

The MA_clust table indicates groups of genes that have well correlated transcription profiles 

and therefore participate in the same pathway or network. 
25 A pathway or network of Trichome genes is any group in the MA_clust that 

comprises a cDNA ID that also appears in Expt ID 108452 of the MA_diff table(s). 

Trichome Genes Identified By Correlation To Genes That Cause 
Physiological Consequences 
30 Additionally, the differential expression data and the phenotypic observations can be 

merged to identify pathways or networks of Trichome genes. A group in the MA_clust is 
considered a Trichome pathway or network if the group comprises a cDNA ID that also 
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appears in Knock-in or Knock-out tables that causes one or more of the phenotypes described 
in section above. 



Trichome Genes Identified By Amino Acid Sequence Similarity 
Trichome genes from other plant species typically encode polypeptides that share 
amino acid similarity to the sequences encoded by com and Arabidopsis Trichome genes. 
Groups of Trichome genes are identified in the Protein Group table. In this table, any protein 
group that comprises a peptide ID that corresponds to a cDNA ID member of a Trichome 
pathway or network is a group of proteins that also exhibits Trichome functions/utilities. 

It is assumed that the genes differentially expressed in trichomes or leaves producing 
trichomes are concerned with specifying trichomes and their functions and therefore 
modulations of such genes and their products modify trichomes and their products. 

Examples of phenotypes, biochemical activities, and transcription profiles that can be 
modulated by selected numbers of these genes and gene products singly or in combinations 
are described above and below. 



B.2.b. USE OF TRICHOME GENES, GENE COMPONENTS AND 
PRODUCTS TO MODULATE PHENOTYPES 
Trichome genes of the instant invention, when mutated or activated differently, are 
useful for modulating one or more processes of trichome structure and/or function including: 
(I) development; (II) plant stress tolerance; and (III) biosynthesis or secretion of trichome- 
specific molecules. Trichome genes, components and gene products are useful to alter or 
modulate one or more of the following phenotypes: 
I. Development 

Trichome differentiation is integrated with leaf development, hormone levels and the 
vegetative development phase. The first trichome at the leaf tip appears only after the leaf 
grows to -100 jam in length. Subsequent events proceed basipetally as the leaf grows. As 
leaf development progresses, cell division patterns become less regular and islands of 
dividing cells can be observed among differentiated pavement cells with their characteristic 
lobed morphology. Trichome initiation in the expanding leaf occurs within these islands of 
cells and often defines points along the perimeter of a circle, with an existing trichome 
defining the center. 
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Once a cell enters the trichome pathway it undergoes an elaborate morphogenesis 
program that has been divided into different stages based on specific morphological 
hallmarks. 

Selected members of the genes, gene components and gene products of this invention 
are useful to modulate any one or all of these growth and development processes by affecting 
rate, timing, direction and size, for example. Trichome genes can also affect trichome 
number and the organs on which they occur. The following can be modulated by these 
genes: 

A. Types of Trichomes 

Applicants provide the means to modulate one or more of the following types of 
trichomes with the genes and gene products of this invention: 

1 . Glandular trichomes 

2. Stellate trichomes 

B. Cell properties 

Trichome genes and gene products are useful to modulate cellular changes in: 

1 . Cell size 

2. Cell division rate and direction 

3. Cell elongation 

4. Cell differentiation 

5. Secretory cells 

6. Trichome number (Average trichome number per leaf for 
mint:13,500,000). 

7. Cell walls 

8. Cell death 

9. Response to reactive oxygen species 

C. Trichome Architecture 

The following parts or arrangement of trichomes on leaves can be modulated to affect 
trichome and/or leaf architecture: 

1 . Trichome cell structure 
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2. 



Placement on leaf 



3. 



Secretory systems 



D. Trichome responses 

Selected members of trichome genes, gene components and gene products of this 
invention are useful to modulate any one or all of the growth and development processes 
above; as in timing and rate, for example. In addition, the polynucleotides and polypeptides 
of the invention can control the response of these processes to internal plant programs and 
signaling molecules such as: 



2. Hormones: 

(a) abscisic acid (for more details see the section on abscisic acid- 
responsive genes) 

(b) auxin (for more details see the section on auxin-responsive 
genes) 

(c) cytokinin (for more details see the section on cytokinin- 
responsive genes) 

(d) gibberellins (for more details see the section on gibberellin- 
responsive genes) 

(e) brassinosteroids (for more details see the section on 



3. Apoptosis 

4. Coordinated trichome growth and development in: 

(a) Flowers 

(b) Stems 

(c) Petioles 

(d) Cotyledons 

(e) Hypocotyls 

II. Stress Tolerance 

The physical characteristics of trichomes as well as the substances secreted by 
trichomes are useful in protecting the plant from both biotic and abiotic attacks. Thus, 
selected trichome genes and gene products can be used to help protect distinct cells, organs, 



1. 



Leaf development 



brassinosteroid-responsive genes) 
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or tissues as well as overall plant yield and vigor. Examples of stresses, tolerances to which 
are modulated by trichome genes and gene products, are as follows: 

A. Drought, e.g., trichome number variation can decrease the surface area 
that allows evaporation 

B. Heat, e.g., trichomes can produce shade and provide protection for 
meristems 

C. Salt 

D. Insects, e.g., trichomes can prevent insects from settling on plant 
surfaces 

E. Herbivory, e.g., trichomes can produce harmful chemicals 

F. Ultraviolet light 

EI. Biosynthesis, Accumulation or Secretion of Metabolites 
The glandular trichomes from various species are shown to secrete and, sometimes, 
locally synthesize a number of substances including salt, monoterpenes and sesquiterpenes, 
terpenoids, exudate, insect entrapping substances, antifeedants, pheromones, and others. 
Therefore, trichome genes can be used to modulate the synthesis, accumulation and secretion 
of a large number of metabolites especially related to trichome biology. Some are 
synthesized in response to biotic and abiotic stresses. For a more detailed description of these 
metabolites see the section "Use of Trichome Genes to Modulate Biochemical Activities" 
below. 

Uses of Plants that Are Modified as Described Above 

Altering trichome properties is useful for modifying one or more of the following 
plant traits making the plants more useful in agriculture, horticulture and for the production 
of valuable molecules. 

A. Production of specific carbohydrates, proteins, oils, aromas, flavors, pigments, 
secondary metabolites such as menthol (and other monoterpenes), etc., that 
can be used in situ or purified and used in a wide variety of industries. 

B. Increased production of molecules synthesized in trichomes by increasing the 
trichome number on different plant organs, such as cotyledons, leaves, 



hypocotyls, stems, petioles, etc. 
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C. Increased cotton fibers per boll due to decreased numbers of trichomes that 
reduces insect hiding and contamination 

D. More optimal growth rate of a whole plant or specific parts of a plant due to 
more optimal trichome cellular development and the better resistance to biotic 
/abiotic stresses. Examples of specific plant parts include, without limitation, 

1. Whole plant 

2. Seedling 

3. Coleoptile elongation 

4. Young leaves 

5. Flowers 

6. Seeds 

7. Fruit 

E. Increased harvested yield of plants, organs and their constituent molecules 
1. Biomass 



(a) Fresh and dry weight during any time in plant life, including 
maturation and senescence 

(b) Number of flowers 

(c) Seed yield 

- Number, size, weight, harvest index 

- Content and composition, e.g. amino acid, jasmonate, oil, 



To regulate any of the phenotype(s) above, activities of one or more of the trichome 
genes or gene products can be modulated in an organism and tested by screening for the 
desired trait. Specifically, the gene, mRNA levels, or protein levels can be altered in a plant 
utilizing the procedures described herein and the phenotypes can be assayed. As an example, 
a plant can be transformed according to Bechtold and Pelletier (Methods. Mol. Biol. 82:259- 
266 (1998)) and/or screened for variants as in Winkler et al., Plant Physiol. 118: 743-50 



(d) 



protein and starch 
Fruit yield 

- Number, size, weight, harvest index, post harvest quality 

- Content and composition, e.g. amino acid, jasmonate, oil, 



protein and starch 
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(1998) and visually inspected for the desired phenotype or metabolically and/or functionally 
assayed. 
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B.2.c. USE OF TRICHOME GENES, GENE COMPONENTS AND 
PRODUCTS TO MODULATE BIOCHEMICAL ACTIVITIES 
The phenotype traits outlined above result from the integration of many cellular 
5 trichome associated processes and biochemical activities. Some of these are known from 
published literature and some can be deduced from the genes discovered in the MA Tables, 
etc. One or more of these trichome genes, gene components and products are useful to 
modulate these cellular processes, biochemical or metabolic activities and/or pathways such 
as those noted below. Such biological activities can be measured according to the citations 
10 included in the Table below: 





PROCESS 


BIOCHEMICAL OR 


CITATIONS INCLUDING 






METABOLIC 


ASSAYS 


if": 




ACTIVITIES AND/OR 




t=r 
«. 




PATHWAYS 






Growth , 


• Cell wall biosynthetic 


Molhoj et. al. (2001). Plant 


f; : 

i pi 


Differentiation 


enzymes 


Mol.Biol. 46, 263-275 


hi \ 

01 


And Development 


• Cell fate determination 


Krishnakumar and 


j— -.; 




proteins 


Oppenheimer (1999). 


rV : : 




• Major pathways of carbon 


Development 1221, 3079- 






and nitrogen metabolism 


3088. 








Kroumova et al. (1994). 








PNAS91, 11437-11441 




Water 


• Cytoskeleton and Trichome 


Schnittgeret al. (1999). 




Conservation And 


morphology and spacing 


Plant Cell 11, 1105-1116 




Resistance To 


controls 


Hulskamp et al (1994). Cell 




Drought And 




76, 555-566 




Other Related 








Stresses 








Trichome exudate 


• Insect repellant 


Insects and The Plant 








Surface, pp 151-172, 








Edward Arnold, London 








(1986) 
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PROCESS 


BIOCHEMICAL OR 


CITATIONS INCLUDING 




METABOLIC 


ASSAYS 




ACTIVITIES AND/OR 






PATHWAYS 




Terpenoid 


• Terpenoid biosynthesis 


Alonso et al. (1992). J. Biol. 


biosynthesis 


enzymes including: 


Chem. 267, 7582-7587 


including 


• Farnesyltranstransferase 


Rajonarivony et al (1992). 


monoterpenes and 


• Geranylgeranyl- 


Arch. Biochem. Biophys. 


sesquiterpenes 


diphosphate synthase 


299, 77-82 




• Geranyltranstransferase 






• Farnesyl-diphosphate 






synthase 






♦ Dimethylallyltranstransf 






erase 






• Geranyl-diphosphate 






synthase 




H 2 0 2 


• NADPH oxidase (subunit) 


Alverez et al (1998) Cell 92, 


accumulation and 


synthesis and function 


773-784 


activation of S AR 




Grant Orozco-Cardenas and 






Ryan (1999) PNAS 96, 






6553-6557 


Antifeedants 


• Lactone biosynthesis 


Paruch et al. (2000). J. 


biosynthesis and 


enzymes 


Agric. Food Chem. 48, 


secretion 




4973-4977 


Pheromone 


• Farnesine biosynthesis 


Teal et al. (1999) Arch. 


biosynthesis and 


enzymes 


Insect Biochem Physiol. 42, 


secretion 




225-232 


Endoreplication 


• Cyclin and cyclin dependant 


De Veylder et al. (2001) 




kinases 


Plant Cell 13, 1653-1668 






De Veylder et al. (2001) 






Plant J. 25,617-626 
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Specific enzyme and other activities associated with the functions of individual 
trichome genes that can be modulated by the trichome genes and gene products are listed in 
the Reference tables where the functions of individual genes and their products are listed. 
Assays for detecting such biological activities are described in the Protein Domain table, for 
example. 

B.2.d. USE OF TRICHOME GENES, GENE COMPONENTS AND 
PRODUCTS TO MODULATE PHENOTYPES BY MODULATING 
TRANSCRIPTION LEVELS OF OTHER GENES 
Many of the genes are "up regulated" or "down regulated" in trichomes because they 
are regulated as members of networks or cascade of genes under the control of regulatory 
genes. Thus some trichome genes are useful to influence levels of other genes and so 
orchestrate the complex phenotypes. Examples of the types of genes with altered transcript 
levels in trichomes are described in the Table below, together with associated biological 
activities. "Up-regulated" profiles are those where the mRNA levels are higher in the 
glaborous plants as compared to the"hairy" plant. "Down-regulated" profiles represent higher 
transcript levels in the "hairy" plant as compared to the glaborous plant. 
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TRANSCRIPT 


TYPE OF GENES 


PHYSIOLOGICAL 


EXAMPLES OF 


LEVELS 


WHOSE 


CONSEQUENCES 


BIOCHEMICAL 




TRANSCRIPTS ARE 


OF MODIFYING 


ACTIVITIES WHOSE 




CHANGED 


GENE PRODUCT 


TRANSCRIPTS ARE 






LEVELS 


CHANGED 


Up Regulated 


Genes active in 


• Changes in 


• Transcription 


Transcripts 


suppressing trichome 


Hormone 


Factors 




formation 


Perception 


• Transporters 






• Changes in 


. Change In Cell G- 






Hormone 


proteins 






Biosynthesis 


• Kinases And 






• Changes in 


Phosphatases 






Specific Gene 


• Transcription 






Transcription 


factors 






Initiation 


• Ca-binding proteins 






• Changes in 


• Transcription 






cytoskeleton and 


Activators 






cell wall 


• Change In 






assembly and 


Chromatin 






structure 


Structure And/Or 








Localized DNA 








Topology 








• Specific Factors 








(Initiation And 








Elongation) For 








Protein Synthesis 








• Maintenance Of 








mRNA Stability 








• Maintenance Of 








Protein Stability 








• Maintenance Of 
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TRANSCRIPT 


TYPE OF GENES 


PHYSIOLOGICAL 


EXAMPLES OF 


LEVELS 


WHOSE 


CONSEQUENCES 


BIOCHEMICAL 




TRANSCRIPTS ARE 


OF MODIFYING 


ACTIVITIES WHOSE 




CHANGED 


GENE PRODUCT 


TRANSCRIPTS ARE 






LEVELS 


CHANGED 








Protein-Protein 








Interaction 


Down-Regulated 


Genes active in 


• Changes in 


• Transcription 


Transcripts 


inducing formation of 


Hormone 


Factors 




trichomes 


Perception 


• Change In Protein 






• Changes in 


Structure By 






Hormone 


Phosphorylation 






Biosynthesis 


(Kinases) Or 






• Changes in 


Dephosphorylation 






Specific Gene 


(Phosphatases) 






Transcription 


• Change In 






Initiation 


Chromatin 






• Changes in 


Structure And/Or 






cytoskeleton and 


DNA Topology 






cell wall 


• G-proteins, Ca2+- 






assembly and 


binding proteins 






structure 






Genes associated with 








Trichome 


• Changes in cell 






differentiation and 


size, cell shape 






structure 








Genes associated with 


• Changes in 
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TRANSCRIPT 
LEVELS 


TYPE OF GENES 
WHOSE 

TRANSCRIPTS ARE 
CHANGED 


PHYSIOLOGICAL 
CONSEQUENCES 
OF MODIFYING 
GENE PRODUCT 
LEVELS 


EXAMPLES OF 
BIOCHEMICAL 
ACTIVITIES WHOSE 
TRANSCRIPTS ARE 
CHANGED 




trichome-specific 
metabolic pathways 


terpenoid 
biosynthesis 
• Changes in 
antifeedant and 
pheromone 
biosynthesis 





While trichome polynucleotides and gene products can act alone, combinations of 
these polynucleotides also affect growth, development and leaf biochemistry. Combinations 
of trichome polynucleotide(s) and/or gene product(s) with genes or gene products involved in 
leaf development, hormone responses, or vegetative development are useful because trichome 
development is integrated with these processes. 



Use of Promoters of Trichome Genes 

Promoters of trichome genes are useful for facilitating transcription of desired 
polynucleotides, both plant and non-plant in trichomes. For example, extra copies of existing 
terpenoid synthesis coding sequences can be operably linked to a trichome gene promoter and 
inserted into a plant to increase the terpenoids in the trichome. Alternatively, trichome 
promoters can direct expression of non-plant genes or genes from another plant species that 
can, for instance, lead to new terpenoids being made. The promoters can also be operably 
linked to antisense copies of coding sequences to achieve down regulation of these gene 
products in cells. 
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I.B.3. CHLOROPLAST GENES, GENE COMPONENTS AND 
PRODUCTS 



The chloroplast is a complex and specialized organelle in plant cells. Its complexity 
5 comes from the fact that it has at least six suborganellar compartments subdivided by double- 
membrane envelope and internal thylakoid membranes. It is specialized to carry out different 
biologically important processes including photosynthesis and amino acid and fatty acid 
biosynthesis. The biogenesis and development of chloroplast from its progenitor (the 
proplasptid) and the conversion of one form of plastid to another (e.g., from chloroplast to 
10 amyloplast) depends on several factors that include the developmental and physiological 
states of the cells. 

One of the contributing problems that complicate the biogenesis of chloroplast is the 
fact that some, if not most, of its components must come from the outside of the organelle 
D itself. The import mechanisms must take into account to what part within the different sub- 
Jb5 compartments the proteins are being targeted; hence the proteins being imported from the 
S cytoplasm must be able to cross the different internal membrane barriers before they can 
reach their destinations. The import mechanism must also take into account how to tightly 

I n 

jp coordinate the interaction between the plastid and the nucleus such that both nuclear and 

plastidic components are expressed in a synchronous and orchestrated manner. Changes in 

CSO the developmental and physiological conditions within or surrounding plant cells can 

consequently change this tight coordination and therefore change how import mechanisms 
are regulated as well. Manipulation of these conditions and modulation of expression of the 
import components and their function can have critical and global consequences to the 
development of the plant and to several biochemical pathways occurring outside the 

25 chloroplast. Expression patterns of such genes have been determined using microarray 
technology. 

Microarray technology allows monitoring of gene expression levels for thousands of 
genes in a single experiment. This is achieved by hybridizing labeled fluorescent cDNA pools 
to glass slides that contain spots of DNA (Schena et al. (1995) Science 270: 467-70). The US 
30 Arabidopsis Functional Genomics Consortium (AFGC) has recently made public the results 
from such microarray experiments conducted with AFGC chips containing about 10,000 non- 
redundant ESTs, selected from about 37,000 randomly sequenced ESTs generated from 
mRNA of different tissues and developmental stages. 
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The sequences of the ESTs showing at least two- fold increases or decreases in a 
mutant in a mutant (CiA2) of Arabidopsis thaliana, that is distributed in chloroplast 
biogenesis relative to wild type grown in the same conditions were identified, compared to 
the Ceres full length cDNA and genomic sequence databanks, and equivalent Ceres clones 
identified. The MA_diff table reports the results of this analysis, indicating those Ceres 
clones which are up or down regulated over controls, thereby indicating the Ceres clones that 
are involved in the import of proteins to chloroplast and chloroplast biogenesis. 
Examples of genes and gene products that are involved in the import of proteins to 
chloroplast are shown in the Reference, Sequence, Protein Group, and Protein Group Matrix 
tables. While chloroplast protein import polynucleotides and gene products can act alone, 
combinations of these polynucleotides also affect growth and development. Useful 
combinations include different chloroplast protein import responsive polynucleotides and/or 
gene products that have similar transcription profiles or similar biological activities, and 
members of the same or functionally related biochemical pathways. Whole pathways or 
segments of pathways are controlled by transcription factor proteins and proteins controlling 
the activity of signal transduction pathways. Manipulation of one or more chloroplast protein 
import gene activities are useful to modulate the biological processes and/or phenotypes 
listed below. Chloroplast protein import responsive genes and gene products can act alone or 
in combination. Useful combinations include chloroplast protein import responsive genes 
and/or gene products with similar transcription profiles, similar biological activities, or 
members of the same or functionally related biochemical pathways. Here, in addition to 
polynucleotides having similar transcription profiles and/or biological activities, useful 
combinations include polynucleotides that may have different transcription profiles but which 
participate in common or overlapping pathways. Whole pathways or segments of pathways 
are controlled by transcription factor proteins and proteins controlling the activity of signal 
transduction pathways. Therefore, manipulation of such protein levels is especially useful for 
altering phenotypes and biochemical activities of plants. Manipulation of one or more 
chloroplast protein import gene activities are useful to modulate the biological processes 
and/or phenotypes listed below. 

Such chloroplast protein import responsive genes and gene products can function to 
either increase or dampen the above phenotypes or activities in response to changes in the 
regulation of import mechanisms. Further, promoters of chloroplast protein transport 
responsive genes, as described in the Reference tables, for example, are useful to modulate 
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transcription that is induced by chloroplast protein transport or any of the following 
phenotypes or biological activities below. Further, any desired sequence can be transcribed in 
similar temporal, tissue, or environmentally specific patterns as the chloroplast protein 
transport responsive genes when the desired sequence is operably linked to a promoter of a 
chloroplast protein transport responsive gene. The MAdiff Table(s) reports the transcript 
levels of the experiment (see EXPT ID: Chloroplast (relating to SMD 8093, SMD 8094)). 
For transcripts that had higher levels in the samples than the control, a "+" is shown. A "-" is 
shown for when transcript levels were reduced in root tips as compared to the control. For 
more experimental detail see the Example section below. 

Chloroplast genes are those sequences that showed differential expression as 
compared to controls, namely those sequences identified in the MA_diff tables with a "+" or 
"-" indication 



Chloroplast Genes Identified By Cluster Analyses Of Differential Expression 

Chloroplast Genes Identified By Correlation To Genes That Are Differentially 

Expressed 

As described above, the transcription profiles of genes that act together are well 
correlated. Applicants not only have identified the genes that are differentially expressed in 
the microarray experiments, but also have identified the genes that act in concert with them. 
The MA_clust table indicates groups of genes that have well correlated transcription profiles 
and therefore participate in the same pathway or network. 

A pathway or network of Chloroplast genes is any group in the MA_clust that 
comprises a cDNA ID that also appears in Expt ID Chloroplast (relating to SMD 8093, SMD 
8094) of the MA_difftable(s). 

Chloroplast Genes Identified By Correlation To Genes That Cause 
Physiological Consequences 

Additionally, the differential expression data and the phenotypic observations can be 
merged to identify pathways or networks of Chloroplast genes. A group in the MA_clust is 
considered a Chloroplast pathway or network if the group comprises a cDNA ID that also 
appears in Knock-in or Knock-out tables that causes one or more of the phenotypes described 
in section above. 
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Chloroplast Genes Identified By Amino Acid Sequence Similarity 
Chloroplast genes from other plant species typically encode polypeptides that share 
amino acid similarity to the sequences encoded by corn and Arabidopsis Chloroplast genes. 
Groups of Chloroplast genes are identified in the Protein Group table. In this table, any 
protein group that comprises a peptide ID that corresponds to a cDNA ID member of a 
Chloroplast pathway or network is a group of proteins that also exhibits Chloroplast 
functions/utilities. 



B.3.a. USE OF CHLOROPLAST PROTEIN IMPORT RESPONSIVE 
GENES TO MODULATE PHENOTYPES 
Chloroplast protein import responsive genes and gene products are useful to or 
modulate one or more of the following phenotypes: 
- Growth 

- Roots 

- Stems 

- Leaves 

- Development 

- Plastid Biogenesis 

- Plastid Division 

- Plastid Development 

- Thylakoid Membrane Structures 

- Differentiation 

- Plastid/Chloroplast Differentiation 

- Photosynthesis 

- Carbon Dioxide Fixation 

- Transport 

- Transcription/Translation Regulation Within Transport Complex 

- Phosphate Translocation 

- Targeted Starch Deposition And Accumulation 
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- Biosynthesis Of Essential Compounds 

- Lipid Biosynthesis 

- Riboflavin Biosynthesis 

- Carotenoid Biosynthesis 

- Aminoacid Biosynthesis 

To improve any of the phenotype(s) above, activities of one or more of the chloroplast 
protein import responsive genes or gene products can be modulated and the plants tested by 
screening for the desired trait. Specifically, the gene, mRNA levels, or protein levels can be 
altered in a plant utilizing the procedures described herein and the phenotypes can be 
assayed. As an example, a plant can be transformed according to Bechtold and Pelletier 
(1998, Methods. Mol. Biol. 82:259-266) and/or screened for variants as in Winkler et al. 
(1998) Plant Physiol 118: 743-50 and visually inspected for the desired phenotype or 
metabolically and/or functionally assayed according to Saito et al. (1994, Plant Physiol. 106: 
887-95), Takahashi et al (1997, Proc. Natl. Acad. Sci. USA 94: 1 1 102-07) and Koprivova et 
al. (2000, Plant Physiol. 122: 737-46). 

B.3.b. USE OF CHLOROPLAST PROTEIN IMPORT-RESPONSIVE GENES TO 
MODULATE BIOCHEMICAL ACTIVITIES 

The activities of one or more of the chloroplast protein import responsive genes can 
be modulated to change biochemical or metabolic activities and/or pathways such as those 
noted below. Such biological activities can be measured according to the citations included 
in the Table below: 



GENERAL CATEGORY 


BIOCHEMICAL OR 
METABOLIC ACTIVITIES 
AND/OR PATHWAYS 


CITATIONS 

INCLUDING 

ASSAYS 


Cell Growth and 
Differentiation 


• Regulation of Leaf 
Development Including 
Photosynthetic 
Apparatus 


Reinbothe et al. (1997) Proc. 
Natl. Acad. Sci. USA. 
94:8890-8894 

Eggink and Hoober (2000) J. 
Biol. Chem. 275:9087-9090 
Jagtap et al. (1998) J Exptl 
Botany 49:1715-1721 
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GENERAL CATEGORY 



BIOCHEMICAL OR 
METABOLIC ACTIVITIES 
AND/OR PATHWAYS 



CITATIONS 

INCLUDING 

ASSAYS 



Regulation of Plastid 
Biogenesis and Plastid 
Division 



Lawrence and Kindle (1997) 
J. Biol. Chem. 272:20357- 
20363 

Lahiri and Allison (2000) 
Plant Physiol. 123:883-894 



Development of Plastid 
Inner/Outer and 
thylakoid Membrane 
Structures 



Kouranov et al. (1999) J. 
Biol. Chem. 274:25181- 
25186 

Jackson et al. (1998) J. Biol. 
Chem. 273:16583-16588 
Li and Chen (1997) J. Biol. 
Chem. 272:10968-10974 
Lawrence and Kindle (1997) 
J. Biol. Chem. 272:20357- 
20363 

Silva-Filhoetal.(1997)J. 
Biol. Chem. 272:15264- 
15269 



Regulation of 
transcription and/or 
translation related to 
maintenance of stability 
of protein-protein 
interaction within 
transport complex 



May and Soli (2000) Plant 
Cell 12:53-63 

Caliebe et al. (1997) EMBO 
J. 16:7342-7350 



Physiology 



Modulation of 
Photosynthesis 



Sung and Krieg (1979) Plant 
Physiol 64: 852-56 



Regulation of Lipid 
Biosynthesis 



Bourgisetal. (1999) Plant 
Physiol. 120:913-922 
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GENERAL CATEGORY 


BIOCHEMICAL OR 
METABOLIC ACTIVITIES 
AND/OR PATHWAYS 


CITATIONS 

INCLUDING 

ASSAYS 






Reverdatto et al. (1999) 
Plant Physiol. 119:961-978 
Roesler et al. (1997) Plant 
Physiol. 113:75-81 




• Regulation of Riboflavin 
(Vitamin B) biosynthesis 


Jordan etal. (1999) J. Biol. 
Chem. 274:22114-22121 




• Regulation of phosphate 
translocation across 
chloroplast membrane 


Flugge (1999) Annu. Rev. 
Plant Physiol. Plant Mol. 
Biol. 50:27-45 

Silva-Filho et al. 

(1997) J. Biol. Chem. 

272:15264-15269 




• Regulation of targeted 
starch depostion and 
accumulation 


Yu etal. (1998) Plant 
Physiol. 116:1451-1460 




. Modulation of protein 
targeting and 
translocation across 
chloroplast membrane 


Summer and Cline (1999) 
Plant Physiol. 119:575-584 
Dabney-Smith et al. (1999) 
J. Biol. Chem. 274:32351- 
32359 

Hinnah et al. (1997) EMBO 
J. 16:7351-7360 




• Regulation of carotenoid 
biosynthesis 


Bonk et al. (1996) Plant 
Physiol. 111:931-939 




• Regulation of amino acid 
biosynthesis 


Flugge (1999) Annu. Rev. 
Plant Physiol. Plant Mol. 
Biol. 50:27-45 




• Regulation of secondary 


Flugge (1999) Annu. Rev. 
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BIOCHEMICAL OR 






GENERAL CATEGORY 


METABOLIC ACTIVITIES 


CITATIONS 






AND/OR PATHWAYS 


INCLUDING 








ASSAYS 






metabolism 


Plant Physiol. Plant Mol. 








Biol. 50:27-45 




Signal Transduction 


• Regulation of gene> 


Chen et al. (2000) Plant 






transcriptional activity 


Physiol. 122:813-822. 






specific to chloroplast 


Macasev et al. (2000) Plant 






protein import 


Physiol. 123:811-816 






• Regulation of protein 


Lang et al. (1998) J. Biol. 


c.5 : 




target signal cleavage 


Chem. 273:30973-30978 


■asr 




and protein degradation 


Jackson et al. (1998) J. Biol. 


5 : 






Chem. 273:16583-16588 








Richter and Lamppa (1998) 








Proc. Natl. Acad. Sci. USA. 








95:7463-7468 


m 

~3 : 




• Regulation of ion 


Van der Wijngaard and 


Hr 
tP 




channel conformation 


Vredenberg (1999) J. Biol. 


3.-;- 




and activity 


Chem. 274:25201-25204 






• Regulation of kinase and 


Waegemann and Soli (1996) 






phosphatases synthesis 


J. Biol. Chem. 271:6545- 






and activity 


6554 








Li et al. (2000) Science 287- 








300-303 








Muller et al. (2000) J. Biol. 








Chem. 275:19475-19481 






• Modulation of Molecular 


Bonk et al. (1996) Plant 






Chaperone and Other 


Physiol. 111:931-939 






Protein Folding Activity 


Walker et al. (1996) J. Biol. 








Chem. 271:4082-4085 
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GENERAL CATEGORY 


BIOCHEMICAL OR 
METABOLIC ACTIVITIES 
AND/OR PATHWAYS 


CITATIONS 

INCLUDING 

ASSAYS 






Kessler and Blobel (1996). 
Proc. Natl. Acad. Sci. USA 
93:7684-7689 

Jackson et al. (1998) J. Biol. 
Chem. 273:16583-16588 



Other biological activities that can be modulated by the chloroplast protein import 
responsive genes and gene products are listed in the Reference tables. Assays for detecting 
such biological activities are described in the Protein Domain table. 

Chloroplast protein import responsive genes are characteristically differentially 
transcribed in response to fluctuating chloroplast protein import levels or concentrations, 
whether internal or external to an organism or cell. The MA diff reports the changes in 
transcript levels of various chloroplast protein import responsive genes that are differentially 
expressed among the mutants and the wild type. 

Profiles of some of these chloroplast protein import responsive genes are shown in the 
Table below together with examples of the kinds of associated biological activities. 



TRANSCRIPT 
LEVELS 


TYPE OF GENES 


PHYSIOLOGICAL 
CONSEQUENCES 


EXAMPLES OF 
BIOCHEMICAL 
ACTIVITY 


Up regulated 


Responders to 


• Chloroplast 


• Transporters 


transcripts 


defective chloroplast 


protein import 


• Metabolic enzymes 




protein import 


regulation 


• Change in cell 






• Chloroplast 


membrane structure 






protein import and 


and potential 




Genes induced by 


transport 


• Kinases and 




defective import 


• Chloroplast 


phosphatases 






import 


• Transcription 






metabolism 


activators 
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TRANSCRIPT 


TYPE OF GENES 


PHYSIOLOGICAL 


EXAMPLES OF 


LEVELS 




CONSEQUENCES 


BIOCHEMICAL 








ACTIVITY 






• Synthesis of 


• Change in 






secondary 


chromatin structure 






metabolites and/or 


and/or localized 






proteins 


DNA topology 






• Modulation of 


• Redox control 






chloroplast import 


• Metabolic enzymes 






response 


concerned with 






transduction 


chloroplast 






pathways 


biochemistry 






• Changes in 


• Organelle gene 






chloroplast 


expression and 






membranes 


translation 






• Specific gene 








transcription 








initiation 








• Chloroplast and 








non-chloroplast 








metabolic 








pathways 




Down-regulated 


Responders to 


• Regulation of 


• Transcription 


transcripts 


defective chloroplast 


chloroplast protein 


factors 




protein import. 


import pathways 


• Change in protein 






released 


structure by 




Genes repressed by 


• Chloroplast 


phosphorylation 




defective chloroplast 


protein import and 


(kinases) or 




protein import 


transport 


dephosphoryaltion 






• Chloroplast 


(phosphatases) 




Genes with unsTable 


import 


• Change in 




mRNAs when 


metabolism 


chromatin structure 
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TRANSCRIPT 


TYPE OF GENES 


PHYSIOLOGICAL 


EXAMPLES OF 


LEVELS 




CONSEQUENCES 


BIOCHEMICAL 








ACTIVITY 




chloroplast import is 


• Changes in 


and/or DNA 




defective 


pathways and 


topology 






processes 


• Stability factors for 




Genes with 


operating in 


protein mRNA 




discontinued 


chloroplasts 


synthesis and 




expression or 


• Changes in 


degradation 




unsTable mRNA in 


organelle 


• Organelle 




presence of 


membranes 


transcription and 




chloroplast protein 


• Loss of organelle 


translation proteins 




import 


gene expression, 


• Metabolic enzymes 






RNA and protein 








synthesis 








• Changes in 








metabolism other 








than chloroplast 








protein import 








pathways 








• Chloroplast 








import 








metabolism 





Use of Promoters of Chloroplast Genes 

Promoters of Chloroplast genes are useful for transcription of any desired polynucleotide 
or plant or non-plant origin. Further, any desired sequence can be 'transcribed in a similar 
5 temporal, tissue, or environmentally specific patterns as the Chloroplast genes where the desired 
sequence is operably linked to a promoter of a Chloroplast gene. The protein product of such a 
polynucleotide is usually synthesized in the same cells, in response to the same stimuli as the 
protein product of the gene from which the promoter was derived. Such promoter are also 
useful to produce antisense mRNAs to down-regulate the product of proteins, or to produce 
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sense mRNAs to down-regulate mRNAs via sense suppression 



f~ 



ill 
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I.C. REPRODUCTION GENES, GENE COMPONENTS AND PRODUCTS 




I.C. 1 . REPRODUCTION GENES, GENE COMPONENTS AND 
5 PRODUCTS 

Reproduction genes are defined as genes or components of genes capable of modulating 
any aspect of sexual reproduction from flowering time and inflorescence development to 
fertilization and finally seed and fruit development. These genes are of great economic interest 
as well as biological importance. The fruit and vegeTable industry grosses over $1 billion USD 
1 0 a year. The seed market, valued at approximately $ 1 5 billion USD annually, is even more 
lucrative. 

Expression of many reproduction genes and gene products is orchestrated by internal 
programs or the surrounding environment of a plant, as described below. These genes and/or 

0 products have great importance in determining traits such as fruit and seed yield. Examples 
d£5 of such reproduction genes and gene products are shown in the Reference, Sequence, Protein 
PC Group, Protein Group Matrix tables, Knock-in, Knock-out, MA-diff and MA-clust. The 

\z biochemical functions of the protein products of many of these genes determined from 

01 comparisons with known proteins are also given in the Reference tables. 

S 

i : 

CpO Reproduction Genes Identified by Phenotypic Observation 

Jl Reproduction genes were discovered and characterized from a much larger set of 

rf genes by experiments designed to find genes that cause phenotypic changes in flower, silique, 
and seed morphology. In these experiments, reproduction genes were identified by either (1) 
ectopic expression of a cDNA in a plant or (2) mutagenesis of the plant genome. The plants 
25 were then cultivated and phenotypes, which varied from the parental <6 wild-type", were 
observed. 

One particular example of reproductive genes are those that are regulated by AP2. 
AP2 is a transcription factor that regulates many genes, both as a repressor of some genes and 
as an activator of others. Some of these genes are those which establish the floral meristem or 
30 those which regulate floral organ identity and development. As such, AP2 has an effect on 
reproduction. This is, loss of AP2 activity is correlated with decreased male and female 
reproduction. AP2 is also known to have an effect on seed mass, and therefore on yield. That 
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is, overexpression of AP2 is correlated with smaller seeds or seedless fruit while repression 
of AP2 correlates with larger seeds (see, e.g. US Patent No.: 5,994,622). 



Another example of reproduction genes are those that are regulated by 



PISTILLATA (PI). PI is a transcription factor that regulates many genes both as a repressor 
and activator. Some of these genes are those which regulate floral organ identity and 
development, in conjunction with other transcription factors such as AP2 and AGAMOUS. 
As such, PI has an effect on reproduction in that loss of PI activity is correlated with 
decreased male reproduction. PI is also known to have an effect on carpel number, and 
therefore potentially on ovule/seed number and yield. Specifically, repression of PI results in 
increased carpel number and therefore ovule number. 

Yet another example of reproductive genes are those that are regulated by MEDEA 
(MEA). MEA is a SET-domain containing protein that associates with other proteins to form 
complexes that affect chromatin structure and therefore gene expression. As such, loss of 
MEA function is correlated with global gene activation and repression leading to many 
phenotypes including decreased female reproduction and therefore reduced seed set and 
yield. 



In the characterization of these and other reproduction genes, the following 



- Size 

- Large 

- Small 

- Shape 

- Abnormal organ numbers 

- Agamous 



- AP-2 like 

- Color 

- Number 

- Fused Sepals 
II. Silique 

Size 



phenotypes were scored: 



I. 



Flower 



Seed number 



- Reduced 



Page 147 of 772 



Attorney UKt. 27^0-14831' 
Client Dkt. 80090.005 

- Absent 
Seed color 

The identified genes regulating reproduction are identified in the Knock-in and 
Knock-out Tables. 



Reproduction Genes Identified by Differential Expression 

Reproduction genes were also identified in experiments designed to discover genes 
whose mRNA products were in different concentrations in whole flowers, flower parts, and 
siliques relative to the plant as a whole. The MA diff Table(s) reports the transcript levels of 
the experiment (see EXPT ID: 108473, 108474, 108429, 108430, 108431, 108475, 108476, 
108501). For transcripts that had higher levels in the samples than the control, a "+" is 
shown. A "-" is shown for when transcript levels were reduced in root tips as compared to 
the control. For more experimental detail see the Example section below. 

Reproduction genes are those sequences that showed differential expression as 
compared to controls, namely those sequences identified in the MA_diff tables with a "+" or 
"-" indication 



Reproduction Genes Identified By Cluster Analyses Of Differential Expression 
Reproduction Genes Identified By Correlation To Genes That Are 
Differentially Expressed 

As described above, the transcription profiles of genes that act together are 
well correlated. Applicants not only have identified the genes that are differentially 
expressed in the microarray experiments, but also have identified the genes that act in concert 
with them. The MA_clust table indicates groups of genes that have well correlated 
transcription profiles and therefore participate in the same pathway or network. 

A pathway or network of Reproduction genes is any group in the MA_clust 
that comprises a cDNA ID that also appears in Expt ID 108473, 108474, 108429, 108430, 
108431, 108475, 108476, 108501 of the MA_diff table(s). 

Reproduction Genes Identified By Correlation To Genes That Cause 
Physiological Consequences 

Additionally, the differential expression data and the phenotypic observations 
can be merged to identify pathways or networks of Reproduction genes. A group in the 
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MA_clust is considered a Reproduction pathway or network if the group comprises a cDNA 
ID that also appears in Knock-in or Knock-out tables that causes one or more of the 
phenotypes described in section above. 



Reproduction Genes Identified By Amino Acid Sequence Similarity 

Reproduction genes from other plant species typically encode polypeptides 
that share amino acid similarity to the sequences encoded by corn and Arabidopsis 
Reproduction genes. Groups of Reproduction genes are identified in the Protein Group table. 
In this table, any protein group that comprises a peptide ID that corresponds to a cDNA ED 
member of a Reproduction pathway or network is a group of proteins that also exhibits 
Reproduction functions/utilities. 

It is assumed that the reproduction genes differentially expressed in floral parts and 
seeds are concerned with specifying flowers and seeds and their functions, and therefore 
modulations of such genes produce variant flowers and seeds. 

Reproductive genes and gene products can function to either increase or dampen the 
phenotypes, biochemical activities and transcription profiles, either in response to changes of 
internal plant programs or to external environmental fluctuations. 

C.l.a. USE OF REPRODUCTION GENES, GENE COMPONENTS AND 
PRODUCTS TO MODULATE PHENOTYPES 

The reproduction genes of the instant invention, when mutated or activated 
differently, are capable of modulating one or more processes of flower, seed and fruit 
development. They are thus useful for improving plants for agriculture and 
horticulture and for providing seeds with a better chemical composition for diverse 
industries including the food, feed and chemical industries. Reproduction genes, gene 
components and products are useful to alter the following traits and properties of 
plants: 

I. Development 

A. Flowering time and number of inflorescences 

B. Flower development 

(a) Anther 

(b) Stamen 
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(c) 


Pollen 


(d) 


Style 


(e) 


Stigma 


(0 


Ovary 


(g) 


Ovule 


00 


Gametes 



C. Pollination and Fertilization 

(a) Sporogenesis 

(b) Gametogenesis 

(c) Zygote formation 

(d) Embryo development 

(e) Endosperm development 

(f) Male sterility, hybrid breeding systems and heterosis 



II. Cellular Properties: 

A. Cell size 

B. Cell shape 

C. Cell death 

D. Cell division 

E. Cell elongation 

F. Cell differentiation 

G. Meiosis 

III. Organ Characteristics: 
A. Flowers 

(1) Receptacle 

(2) Sepals, Petals, and Tepals 

a. Color 

b. Shape 



c. 



Size 



d. 



Number 



e. 



Petal Drop 



(3) 



Androecium 
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Stamen 

(i) Anther 

1. Size 

2. Pollen 

a. Sterile 

b. Size, shape, weight, color 

3. Number 

(ii) Filament 
1. Size 

Gynoecium 
a. 



b. 



Carpel 




(i) 


Ovary 


(ii) 


Number 


(iii) 


Length 


Style 




(i) 


Stigma 


00 


Ovule 



Size, shape, number 
Pedicel and Peduncle 

a. Size 

b. Shape 



Placenta 

Embryo 

Cotyledon 

Endosperm 

Suspensor 

Seed coat (testa) 

Aleurone 

Development, including Apomixis (gametophytic, 
apospory, diplospory) 

Dormancy 
Germination 
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C. Fruits 

1 . Pericarp - thickness, texture 

(a) Exocarp 

(b) Mesocarp 

(c) Endocarp 

2. Development 

(a) Seed set 

(b) Fruit set 

(c) False fruit 

(d) Fruit elongation and maturation 

(e) Dehiscence 

3. Fruit drop 

IV. Plant Seed Yield 

1 . Increased biomass 

2. Better Harvest Index 

3 . Attraction of favorable insects 

4. Better seed quality 

5. Better yield of constituent chemicals 

V. Plant Population Features: 
A. Architecture 

1. Shade avoidance 

2. Planting density 



To regulate any of the phenotype(s) above, activities of one or more of the 
reproduction genes or gene products can be modulated in an organism and tested by 
screening for the desired trait. Specifically, the gene, mRNA levels or protein levels can be 
altered in a plant using the procedures described herein and the phenotypes can be assayed. 
As an example, a plant can be transformed according to Bechtold and Pelletier (Methods 
Mol. Biol . 82:259-266 (1998)) and/or screened for variants as in Winkler et al. (Plant Physiol 
1 18:743-50 (1998)) and visually inspected for the desired phenotype or metabolically and/or 
functionally assayed. 



Page 152 of 772 




Attorney Diet Z7MM4S31' 
Client Dkt. 80090.005 

C.l.b. USE OF REPRODUCTION GENES TO MODULATE BIOCHEMICAL 
ACTIVITIES 

The activities of one or more of the reproduction genes can be modulated to change 
biochemical or metabolic activities and/or pathways such as those examples noted below. 
Such biological activities can be measured according to the citations included in the Table 
below: 



FUNCTION/PROCESS 


EXAMPLES OF 
BIOCHEMICAL/MOLECU 
LAR ACTIVITIES 


Reference AND ASSAY 


Metabolism 


Energy production and 


Ap Rees, T. (1974). In 




conversion 


"Plant Biochemistry. 




- Glucosyl-transferase 


Biochemistry, Series One", 




(CLONEJD 1040) 


Vol. 11. (H.L. Romberg and 




- Heme-binding protein 


D.CPhillips, eds.), 




(putative cytochrom 


Butterworths, London. 




B5) 


Juliano, B.O. and Varner, 




(CLONEJD 3743) 


J.E.(1969). Plant Physiol. 






44, 886-892. 




Storage protein synthesis 


Bewley et al. (1993). Plant 






Physiol. Biochem. 31, 483- 




Inorganic ion transport and 


490. 




metabolism 


Hills, M. J. and Beevers, H. 




- Peroxidase 


(1987). Plant Physiol. 84, 




(CLONE JD 100990) 


272-276. 




- Cystathione beta 


Olsen, L. J. and Harada, J. J. 




synthase 


(1991). In "Molecular 




(CLONEJD 21847) 


Approaches to 






Compartmentalization and 




Amino acid transport and 


Metabolic Regulation (A. H. 




metabolism 


C. Huang and L. Taiz, eds.), 




- 1-asparaginase 


ASPP, Rockville, Md. 




(CLONEJD 92780) 


Mitsuhashi, W. and Oaks, 
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- Putative peptide/amino 


A. (1994). Plant Physiol. 




acid 


104, 401-407. 




transporter 


Walker-Smith, D. J., and 




(CLONEJD 113723) 


Payne, J. W. (1985). Planta 






164, 550-556. 




Carbohydrate transport and 


Salmenkallio, M. and 




metabolism 


Sopanen, T. (1989). Plant 




- Glucose transport 


Physiol. 89, 1285-1291. 




protein 


Baumgartner, B. and 




(CLONEJD 33727) 


Chrispeels, M. J. (1976). 




- Putative sugar 


Plant Physiol. 58, 1-6. 




transporter 


Elpidina, E. N. et al. (1991). 




(CLONE JD 3250) 


Planta 185, 46-52. 




- Starch biosynthesis 


Ericson, M. C. and 






Chrispeels, M. J. (1973). 




Coenzyme metabolism 


Plant Physiol. 52, 98-104. 




- Tyrosine 


Kern, R. and Chrispeels, M. 




aminotransferase 


J. 1978) Plant Physiol. 62, 






815-819. 




(ROOTY/SUPERROOT1) 


Dilworth, M. F. and Dure, 




(CLONEJD 14570) 


L. IE. (1978). Plant Physiol. 




- Formate dehydrogenase 


61, 698-702. 




(CLONEJD 7530) 


Chrispeels, M. J. and Jones, 






R. L. (1980/81). Isr. J. Bot. 




Lipid metabolism 


29, 222-245. 




- Branched chain oc- 


Gould, S. E. B., and Rees, 




ketoacid 


D. A. (1964). J. Sci. Food 




dehydrogenase E2 


Agric. 16, 702-709. 




subunit 






(CLONEJD 25116) 






- Acyl carrier protein- 1 






(CLONEJD 14291) 






- Lipid metabolic enzymes 
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Secretion 

- Sensor protein RcsC- 
like 

(CLONEJD 16461) 

- Signal recognition 
particle 

RP54 (CLONEJD 
22158) 




Modulate floral organ 
number 


Transcriptional control 
ANT (AP2-domain) DNA 
binding protein 
SUP (Zinc ringer) 


Elliot etal. (1996). Plant 

Cell 8, 155-168. 

Sakai et al. (2000). Plant 

Cell 12, 1607-1618. 

Jacobsen and Meyerowitz 

(1997). Science 277, 1100- 

1103. 


Floral organ size 


Transcriptional control 
ANT (AP2-domain) DNA 
binding protein 


Mizukami et al. (2000). 
PNAS 97, 942-947. 
Krizek(1999). 
Developmental Genetics 25, 
224-236. 


Female organ 
number/Floral meristem 
size 


Membrane receptor kinase 
signal transduction 
CLV1 (LRR domain and 
kinase domain) receptor 
CLV2 (LRR domain) 
receptor 

CLV3 (Receptor ligand) 


Clark and Meyerowitz 
(1997). Cell 89, 575-585 
Jeong etal. (1999). Plant 
Cell 11, 1925-1934. 
Fletcher et al. (1999). 
Science 283, 1911-1914. 


Female reproduction 


DNA binding protein 
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AG (MADS domain) DNA 
binding protein 


Yanofskyetal.(1990). 
Nature 346, 35-39. 


Female reproduction 


Signal transduction 
CTR1 (Raf kinase) 


Kieberetal. (1993). Cell 
72, 427-441. 


Male organ number 


DNA methylation 
MET1 (DNA 
methyltransferase) 


Jacobsen and Meyerowitz 
(1997). Science 277, 1100- 
1103. 


Seed size control 


DNA binding protein 
AP2 (AP2 domain) 
RAP2 (AP2 domain) 


Jofuku et al. (1994). Plant 
Cell 6, 1211-1225. 
US Patent #6,093,874; 
#5,994,622 


Seed size control 


Polycomb group protein 

complex 

FIE, FIS2, MEA 


Luo et al. (2000). PNAS 97, 
10637-10642. 


Seed size control 


DNA methylation 
MET1 


Scott et al. (2000). 
Development 127, 2493- 
2502. 

Vinkenoog et al. (2000). 
Plant Cell 12, 2271-2282. 
Luo et al. (2000). PNAS 97, 
10637-10642. 


Embryo 

development/Embryo 
viability 


CAAT box binding complex 

LEC1/HAP3 

HAP2, HAP5 


Lotan et al. (1998). Cell 93, 
1195. 

US Patent #6,235,975 


Embryo development/Seed 
dormancy 


DNA binding proteins 
ABU (AP2 domain) 


Finkelstein et al. (1998). 
Plant Cell 10, 1043-1054. 
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FUS3 (B3 domain) 


Luerssen et al. (1998). Plant 






VP1 (B3 domain) 


J. 15,755-764. 




Embryo development 


Signal transduction 








ABI1, ABI2 


Leung et al. (1994). Science 






[Serine/threonine protein 


264, 1448-1452. 






phosphatase 2C (PP2C)] 


Leung etal. (1997). Plant 








Cell 9, 759-771. 




Endosperm development 


Chromatin level control of 








gene activity 








Polycomb complex; FIE, 


Ohad et al. (1996). PNAS 






MEA, FIS2 


93,5319-5324. 


if 1 ! 






US Patent #6,229,064 


i 






Kiyosueetal. (1999). 








PNAS 96, 4186-4191. 


n i 






Grossniklaus et al. (1998). 


yi 






Science 280, 446-450. 


2 






Chaudhury et al. (1997) 


CD 

rl i 






PNAS 94, 4223-4228. 


= w 

f 


Integument 


DNA binding 






development/Seed coat 


AP2, ANT (AP2 domain) 


Jofukuetal. (1994). Plant 




development 




Cell 6, 1211-1225. 








Klucher et al. Plant Cell 8, 








137-153. 






BEL1 (Homeodomain) 


Reiser et al. (1995). Cell 83, 








735-742. 




Anthocyanin production 


Secondary transporter 








TT12 (MATE; multidrug and 


Debeaujon et al. (2001). 






toxic compound extrusion) 


Plant Cell 13, 853-872. 




Anthocyanin production 


DNA binding protein 








TT8 (Basic helix-loop-helix 


Nesi et al. (2000). Plant Cell 






domain) 


12, 1863-1878. 



Page 157 of 772 



Attorney Diet. 275U-1483F 
Client Dkt. 80090.005 



Fruit development 


Chromatin level control of 


Ohad et al. (1996). PNAS 




gene activity 


93, 5319-5324. 




Polycomb complex; FIE, 


Kiyosueetal.(1999). 




MEA, FIS2 


PNAS 96, 4186-4191. 






Grossniklaus et al. (1998). 






Science 280, 446-450. 






Chaudhury et al. (1997) 






PNAS 94, 4223-4228. 


Fruit size control 


Signal transduction 






FW2.2 (c-Ras P21) 


Frary et al. (2000). Science 






289, 85-88. 


Fruit development/Pod 


Transcriptional control 




shattering 


SHP1, SHP2, FUL (MADS 


Liljegren et al. (2000). 




domain) DNA binding 


Nature 404, 766-770. 




proteins 


Ferrandiz et al. (2000). 






Science 289, 436-438.. 


Transcription and 


Transcription 


Delseny, M. et al. (1977). 


Posttranscription 


- SRF-domain AGL11 


Planta 135, 125-128. 




(CLONEJD 32791) 


Lalonde, L. and Bewley, J. 




- AP2-domain containing 


D. (1986). J. Exp. Bot. 37, 




protein (CLONEJD 


754-764. 




332) 


Walling, L. et.al. (1986). 




- Myb-DNA binding 


PNAS 83, 2123-2125. 




protein 


Okamuro, J. K. and 




(CLONEJD 94597) 


Goldberg, R. B. (1989). In 






"Biochemistry of Plants, 




Transcription factors 


Vol 15." Academic Press, 






Inc. 




Signal transduction 


Wong, J. etal. (1995). 




mechanisms > 


Genes Dev. 9, 2696-2711. 




- Protein-kinases 


Dimitrov et al. (1994). J. 




- Phosphatases 


Cell Biol. 126, 591-601. 




- meiosis proteins 


Landsberger, N. and 
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- Chromatin remodeling 


Wolffe, A. P. (1997). 




proteins 


EMBOJ. 16, 4361-4373. 




- Chaperones 


Bogdanove, A. J. and 




- Chalcone synthase 


Martin, G. G. (2000). PNAS 




- Putative Ser/Thr protein 


97, 8836-8840. 




kinase (CLONEJD 


Zhu, H. et al. Science July 




31383) 


26, 2001: 




- ER6-like protein 


10.1 126/science. 1062191 




(implicated in ethylene 


(Reports). 




signal transduction) 






(CLONEJD 7474) 






Translation, ribosomal 






structure and biogenesis 






- Ribosomal proTein 






S15A 






(CLONEJD 17466) 






- Translation initiation 






factor 






(CLONEJD 103464) 






Posttranslational 






modification, protein 






turnover, chaperones 






- DnaJ-domain containing 






protein (CLONE ID 






4150) 






- Cyclophilin-like protein 






(CLONEJD 35643) 




Cell division and Repair 


Cell division and 


Rogan, P. G. and Simon, E. 




chromosome partitioning 


W. (1975). New Phytol. 74, 




- Protein of unknown 


273-275. 
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function 

with tropomyosin-, 
myosin 

tail- and filament- 
domains 

(CLONEJD 15546) 

- Actin-1 
(CLONEJD 25785) 

DNA replication, 
recombination and repair 

- Proliferating cell 
nuclear 

antigen- 1 (axillary 
protein, 

DNA polymerase I 
delta) 

(CLONEJD 28554) 

- AAA-type ATPase, 
cdc48 

(CLONEJD 100292) 



Morahashi, Y. and Bewley, 
J. D. (1980). Plant Physiol 
66,70-73. 

Morahashi, Y. et al. (1981). 
Plant Physiol. 68,318-323. 
Morahashi, Y. (1986). 
Physiol. Plant. 66, 653-658. 
Zlatanova, J. et al. (1987). 
Plant Mol. Biol. 10, 139- 
144. 

Zlatanova, J. and Ivanov, P. 
(1988). Plant Sci. 58,71-76. 



Cell envelope biogenesis, 
outer membrane 

- dTDP-D-glucose 4,6- 
dehydratase 
(CLONEJD 28597) 

- Putative cinnamoyl- 
CoA 

reductase 

(CLONEJD 109228) 
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Other biological activities that are modulated by the reproductive genes and gene 
products are listed in the Reference tables. Assays for detecting such biological activities are 
described in the Protein Domain table, for example. 



C. 1 .c. USE OF REPRODUCTION GENES, GENE COMPONENTS AND 

PRODUCTS TO MODULATE TRANSCRIPTION LEVELS 

Reproduction genes are characteristically differentially transcribed in response to cell 
signals such as fluctuating hormone levels or concentrations, whether internal or external to 
an organism or cell. Many reproduction genes belong to networks or cascades of genes under 
the control of regulatory genes. Thus some reporduction genes are useful to modulate the 
expression of other genes. Examples of transcription profiles of reproduction genes are 
described in the Table below with associated biological activities. "Up-regulated" profiles are 
those where the mRNA transcript levels are higher in flowers, flower parts or siliques as 
compared to the plant as a whole. "Down-regulated" profiles represent higher transcript 
levels in the whole plant as compared to flowers, flower parts or siliques alone. 









EXAMPLES OF 


TRANSCRIPT 


TYPE OF GENES 


PHYSIOLOGICAL 


BIOCHEMICAL 


LEVELS 


WITH ALTERED 


CONSEQUENCES 


ACTIVITIES OF 




ACTIVITY 


OF ALTERING 


GENES WITH 






GENE EXPRESSION 


ALTERED 








EXPRESSION 


Up Regulated 


Genes that control 


Flowers form from 


Transcription Factors 


Transcripts 


flower differentiation, 


flower meristem 


Signal transduction 


Flower 


number and size 




Membrane Structure 


Reproduction 






Protein kinases 


Genes 


Genes that promote 


Floral organs mature 


Phosphatases 




petal, stamen and 




Meiosis proteins 




carpel formation 




Chromatin 








remodeling proteins 








Chaperones 
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EXAMPLES OF 


TRANSCRIPT 


TYPE OF GENES 


PHYSIOLOGICAL 


BIOCHEMICAL 


LEVELS 


WITH ALTERED 


CONSEQUENCES 


ACTIVITIES OF 




ACTIVITY 


OF ALTERING 


GENES WITH 






GENE EXPRESSION 


ALTERED 








EXPRESSION 




Genes controlling 


Flavonoid pathways 


Chalcone synthase 




flower-specific 


induced 






metabolism such as 








petal pigments 












Amino acid transport 




Genes that promote 




and metabolism 




ovule formation 




Storage protein 








synthesis 




Genes that promote 




Lipid metabolic 




fertilization, seed, 




enzymes 




embryo and 




Carbohydrate 




endosperm formation 




transport and 








metabolism 








Starch biosynthesis 


AP2 


Genes activated by 


Many steps and 


Proteins associated 


Reproduction 


AP2 transcription 


pathways induced, 


with: 


Genes 


factors 


developmental and 


Energy production 






metabolic 


and conversion 








Amino acid transport 




Genes that induce 


No petals or stamens 


and metabolism 




petal and stamen 


produced 


Carbohydrate 


• 


formation 




transport and 








metabolism 
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EXAMPLES OF 


TRANSCRIPT. 


TYPE OF GENES 


PHYSIOLOGICAL 


BIOCHEMICAL 


LEVELS 


WITH ALTERED 


CONSEQUENCES 


ACTIVITIES OF 




ACTIVITY 


OF ALTERING 


GENES WITH 






GENE EXPRESSION 


ALTERED 








EXPRESSION 








Lipid metabolism 








Transcription and 








signal transduction 








Poor translational 








modification 








DNA replication 








Chromatin 








remodeling 


Down-Regulated 








Transcripts 








Flower 


Genes that repress 


Flowers form from 


Transcritipion factors 


Reproduction 


flower development 


flower meristem 


Signal transduction 


Genes 






pathways 








Kinases and 








phosphatases 




Genes that induce 


Non-floral organs are 


Chromatin 




stem, leaf and other 


repressed 


remodeling protems 




organ differentiation 








Genes that negatively 


Flower-specific 






regulate flower 


pathways are induced 






specific metabolism 








Genes that negatively 








regulate ovule 








formation, meiosis, 
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EXAMPLES OF 


TRANSCRIPT 


TYPE OF GENES 


PHYSIOLOGICAL 


BIOCHEMICAL 


LEVELS 


WITH ALTERED 


CONSEQUENCES 


ACTIVITIES OF 




ACTIVITY 


OF ALTERING 


GENES WITH 






GENE EXPRESSION 


ALTERED 








EXPRESSION 




fertilization and seed 








development 








Genes activated by 


Many steps and 


Proteins associated 


AP2 Reproduction 


AP2 transcription 


pathways induced, 


with: 


Genes 


factors 


developmental and 


Energy production 






metabolic 


and conversion 








Amino acid transport 




Genes that induce 


No petals or stamens 


and metabolism 




petal and stamen 


produced 


Carbohydrate 




formation 




transport and 








metabolism 








Lipid metabolism 








Transcription and 








signal transduction 








Poor translational 








modification 








DNA replication 








Chromatin 








remodeling 



While polynucleotides and gene products modulating reproduction can act alone, 
combinations of these polynucleotides also affect growth and development. Useful 
combinations include different polynucleotides and/or gene products of the instant invention 
that have similar transcription profiles or similar biological activities, and members of the 
same or similar biochemical pathways. In addition, the combination of a polynucleotide 
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and/or gene product(s) capable of modulating reproduction with a hormone responsive 
polynucleotide, particularly one affected by gibberellic acid and/or auxin, is also useful 
because of the interactions that exist between hormone regulated pathways, and development. 
Here, in addition to polynucleotides having similar transcription profiles and/or biological 
activities, useful combinations include polynucleotides that may have different transcription 
profiles but which participate in common or overlapping pathways. 

Use Of Promoters And Reproduction Genes 

Promoter of reproduction genes are useful for transcription of desired 
polynucleotides, both plant and non-plant. For example, extra copies of carbohydrate 
transporter genes can be operably linked to a reproduction gene promoter and inserted into a 
plant to increase the "sink" strength of flowers or siliques. Similarly, reproduction gene 
promoters can be used to drive transcription of metabolic enzymes capable of altering the oil, 
starch, protein or fiber of a flower or silique. Alternatively, reproduction gene promoters can 
direct expression of non-plant genes that can, for instance confer insect resistance specifically 
to a flower. 
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I.C.2. OVULE GENES, GENE COMPONENTS AND PRODUCTS 



The ovule is the primary female sexual reproductive organ of flowering plants. It 
contains the egg cell and, after fertilization occurs, contains the developing seed. 
Consequently, the ovule is at times comprised of haploid, diploid and triploid tissue. As such, 
ovule development requires the orchestrated transcription of numerous polynucleotides, some 
of which are ubiquitous, others that are ovule-specific and still others that are expressed only 
in the haploid, diploid or triploid cells of the ovule. 

Although the morphology of the ovule is well known, little is known of these 
polynucleotides and polynucleotide products. Mutants allow identification of genes that 
participate in ovule development. As an example, the pistillata (PI) mutant replaces stamens 
with carpels, thereby increasing the number of ovules present in the flower. Accordingly, 
comparison of transcription levels between the wild-type and PI mutants allows identification 
of ovule-specific developmental polynucleotides. 

Changes in the concentration of ovule-specific polynucleotides during development 
results in the modulation of many polynucleotides and polynucleotide products. Examples of 
such ovule-specific responsive polynucleotides and polynucleotide products are shown in the 
Reference, Sequence, Protein Group, Protein Group Matrix, MA_diff, and MA clust tables. 
These polynucleotides and/or products are responsible for effects on traits such as fruit 
production and seed yield. 

While ovule-specific developmentally responsive polynucleotides and polynucleotide 
products can act alone, combinations of these polynucleotides also affect fruit and seed 
growth and development. Useful combinations include different ovule-specific 
developmentally responsive polynucleotides and/or polynucleotide products that have similar 
transcription profiles or similar biological activities, and members of the same or similar 
biochemical pathways. In addition, the combination of an ovule-specific developmentally 
responsive polynucleotide and/or polynucleotide product with an environmentally responsive 
polynucleotide is also useful because of the interactions that exist between development, 
hormone regulated pathways, stress pathways and nutritional pathways. Here, in addition to 
polynucleotides having similar transcription profiles and/or biological activities, useful 
combinations include polynucleotides that may have different transcription profiles but which 
participate in a common pathway. The MA diff Table(s) reports the transcript levels of the 
experiment (see EXPT ID: 108595). For transcripts that had higher levels in the samples than 
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the control, a is shown. A "-" is shown for when transcript levels were reduced in root 
tips as compared to the control. For more experimental detail see the Example section below. 
Ovule genes are those sequences that showed differential expression as compared to controls, 
namely those sequences identified in the MA diff tables with a "+" or "-" indication 

Ovule Genes Identified By Cluster Analyses Of Differential Expression 

Ovule Genes Identified By Correlation To Genes That Are Differentially Expressed 
As described above, the transcription profiles of genes that act together are well 

correlated. Applicants not only have identified the genes that are differentially expressed in 

the microarray experiments, but also have identified the genes that act in concert with them. 

The MAclust table indicates groups of genes that have well correlated transcription profiles 

and therefore participate in the same pathway or network. 

A pathway or network of Ovule genes is any group in the MA clust that comprises a 

cDNA ID that also appears in Expt ID 108595 of the MA_diff table(s). 

Ovule Genes Identified By Correlation To Genes That Cause Physiological 
Consequences 

Additionally, the differential expression data and the phenotypic observations can be 
merged to identify pathways or networks of Ovule genes. A group in the MA_clust is 
considered a Ovule pathway or network if the group comprises a cDNA ID that also appears 
in Knock-in or Knock-out tables that causes one or more of the phenotypes described in 
section above. 

Ovule Genes Identified By Amino Acid Sequence Similarity 

Ovule genes from other plant species typically encode polypeptides that share amino 
acid similarity to the sequences encoded by corn and Arabidopsis Ovule genes. Groups of 
Ovule genes are identified in the Protein Group table. In this table, any protein group that 
comprises a peptide ID that corresponds to a cDNA ID member of a Ovule pathway or 
network is a group of proteins that also exhibits Ovule functions/utilities. 

Such ovule-specific developmentally responsive polynucleotides and polynucleotide 
products can function to either increase or dampen the above phenotypes or activities either 
in response to transcript changes during ovule development or in the absence of ovule- 



Page 167 of 772 




Attorney im. 27MJ- 14831' 
Client Dkt 80090.005 



specific polynucleotide fluctuations. More specifically, ovule-specific developmental^ 
responsive polynucleotides and polynucleotide products are useful to or modulate one or 
more of the following phenotypes: 
- Egg Cell 



Maturation 

Metabolism 
Polar nuclei 
Fusion 

Central Cell 

Maturation 

Metabolism 

Synergids 
Maturation 

Programmed cell death 

Nucellus 

Maturation 

Integuments 

Maturation 

Funiculus 

Extension 

Cuticle 
Maturation 
Tensile properties 



for development of parthenogenic embryos 



for development of parthenogenic endosperm 



For alteration of endosperm metabolism 



for increased seed 



for increased seed size 



Ovule 

Modulation of ovule senescence 
Shaping 

for increased seed number 
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To produce the desired phenotype(s) above, one or more of the ovule-specific 
developmentally responsive polynucleotides and polynucleotide products can be 
tested by screening for the desired trait. Specifically, the polynucleotide, mRNA 
levels, or protein levels can be altered in a plant utilizing the procedures described 
herein and the phenotypes can be assayed. As an example, a plant can be transformed 
according to Bechtold and Pelletier (1998, Methods. Mol. Biol. 82:259-266) and 
visually inspected for the desired phenotype or metabolically and/or functionally 
assayed according to Weigel et al. (2000, Plant Physiol 122: 1003-14) and Winkler et 
al. (1998, Plant Physiol 118: 743-50). 

Alternatively, the activities of one or more of the ovule-specific 
developmentally responsive polynucleotides and polynucleotide products can be 
modulated to change biochemical or metabolic activities and/or pathways such as 
those noted below. Such biological activities can be measured according to the 
citations included in the Table below: 



GENERAL 
CATEGORY 


BIOCHEMICAL OR 
METABOLIC ACTIVITIES 
AND/OR PATHWAYS 


ASSAY 


Cell Growth and 


-Programmed Cell Death 


Pennell and Lamb 


Differentiation 




(1997) Plant Cell 9, 






1157-1168 




- DNA Methylation and 


Adams et al. (2000) 




Imprinting 


Development 127: 






2493-502 


Organ Growth and 


-Ovule Growth and 


De Martinis and 


Development 


Development 


Mariani (1999) 




-Ethylene Response 


Plant Cell 11: 




-Megagametophyte 


1061-72 




Development 


Christensen et al. 






(1997) Sexual Plant 






Reproduc 10: 49-64 




-Seed Growth and 






Development 


Scott etal. (1998) 
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GENERAL 


BIOCHEMICAL OR 


ASSAY 


CATEGORY 


METABOLIC ACTIVITIES 






AND/OR PATHWAYS 








Development 125: 




-Fertilization 


3329-41 




Independent 


Ohad et al. (1996) 




Seed Development 


PNAS USA 93: 






5319-24 






Chaudhury et al. 






(1997) PNAS USA 






94: 4223-28 


Signal Transduction 


-Ethylene Metabolism 


DeMartinis and 






Mariani (1999) 






Plant Cell 11: 




- Protein Remodeling 


1061-1072 




-Sucrose Mobilization 


Winkler et al. 




and 


(1998) Plant 




Partitioning 


Physiol 118: 743- 




-Pollen Tube Adhesion 


750 




-Jasmonic Acid 






Biosynthesis 
- 




Senescence and Cell 


-Apomixis 




Death 






Environmental 


-Wound and Defense 


Epple and 


Responses 


Response Gene 


Bohlmann(1997) 




Expression 


Plant Cell 9: 509- 






20 






He et al.(1998) 






Plant J. 14: 55-63 




-Stress Response 
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Other biological activities that can be modulated by the ovule-specific 
developmentally responsive polynucleotides and polynucleotide products are listed in the 
Reference tables. Assays for detecting such biological activities are described in the Protein 
Domain table section. 

Ovule-specific developmentally responsive polynucleotides are characteristically 
differentially transcribed in response to fluctuating developmental-specific polynucleotide 
levels or concentrations, whether internal or external to a cell. The MA diff Table reports 
the changes in transcript levels of various ovule-specific developmentally responsive 
polynucleotides in ovules. 



developmentally responsive polynucleotides. Profiles of these different ovule-specific 
developmentally responsive polynucleotides are shown in the Table below with examples of 
associated biological activities. 



These data can be used to identify a number of types of ovule-specific 
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TRANSCRIPTS 


TYPES OF GENES 


PHYSIOLOGICAL 


EXAMPLES OF 


AFFECTED BY 




CONSEQUENCES 


BIOCHEMICAL 








ACTIVITY 


Ethylene Signals 


Responders to 


- Ethylene Perception 


-Transcription 




Ethylene 


-Ethylene Uptake 


Factors 






- Modulation of Ethylene 


-Transporters 






Response Transduction 








Pathways 








- Specific Gene 








Transcription Initiation 




Protein 








Remodeling 




-Repression of Pathways 


-Inhibit Transport of 






to Optimize Abscissic 


Abscissic acid 






acid Response Pathways 


-Degradation 


Lower at 1 hours 


High Abscissic acid 




-Abscissic acid 


than 6 hours 


Responders 




Metabolic Pathways 




Repressor of Abscissic 


Negative Regulation of 






acid Deprivation 


Abscissic acid Pathways 






Pathways 







Use of Promoters of Ovule Genes 

Promoters of Ovule genes are useful for transcription of any desired polynucleotide or 
5 plant or non-plant origin. Further, any desired sequence can be transcribed in a similar 
temporal, tissue, or environmentally specific patterns as the Ovule genes where the desired 
sequence is operably linked to a promoter of a Ovule gene. The protein product of such a 
polynucleotide is usually synthesized in the same cells, in response to the same stimuli as the 
protein product of the gene from which the promoter was derived. Such promoter are also 
10 useful to produce antisense mRNAs to down-regulate the product of proteins, or to produce 
sense mRNAs to down-regulate mRNAs via sense suppression. 
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I.C.3. SEED AND FRUIT DEVELOPMENT GENES, GENE 



COMPONENTS AND PRODUCTS 



The ovule is the primary female sexual reproductive organ of flowering plants. At 
maturity it contains the egg cell and one large central cell containing two polar nuclei encased 
by two integuments that, after fertilization, develops into the embryo, endosperm, and seed 
coat of the mature seed, respectively. As the ovule develops into the seed, the ovary matures 
into the fruit or silique. As such, seed and fruit development requires the orchestrated 
transcription of numerous polynucleotides, some of which are ubiquitous, others that are 
embryo-specific and still others that are expressed only in the endosperm, seed coat, or fruit. 
Such genes are termedfruit development responsive genes. 

Changes in the concentration of fruit-development responsive polynucleotides during 
development results in the modulation of many polynucleotides and polynucleotide products. 
Examples of such fruit development responsive polynucleotides and polynucleotide products 
relative to leaves and floral stem are shown in the Reference, Sequence, Protein Group, 
Protein Group Matrix, MAdiff, MA clust, Knock-in and Knock-out tables. The 
polynucleotides were discovered by isolating fruits at developmental stages from Arabidopsis 
wild-type ecotype "Wassilewskija", and measuring the mRNAs expressed in them relative to 
those in a leaf and floral stem sample. These polynucleotides and/or products are responsible 
for effects on traits such as seed size, seed yield, seed composition, seed dormancy, fruit 
ripening, fruit production, and pod shattering. 

While fruit development responsive polynucleotides and polynucleotide products can 
act alone, combinations of these polynucleotides also affect fruit and seed growth and 
development. Useful combinations include different polynucleotides and/or polynucleotide 
products that have similar transcription profiles or similar biological activities, and members 
of the same or functionally similar biochemical pathways. In particular, modulation of 
transcription factors and/or signal transduction pathways are likely to be useful for 
manipulating whole pathways and hence phenotypes. In addition, the combination of ovule- 
developmentally responsive polynucleotides and/or polynucleotide products with 
environmentally responsive polynucleotides is also useful because of the interactions that 
exist between development, hormone regulated pathways, stress and pathogen induced 
pathways and nutritional pathways. Here, useful combinations include polynucleotides that 
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may have different transcription profiles, and participate in common or overlapping 
pathways but combine to produce a specific, phenotypic change. 

Such fruit development responsive polynucleotides and polynucleotide products can 
function to either increase or dampen the above phenotypes or activities either in response to 
transcript changes in fruit development or in the absence of fruit development polynucleotide 
fluctuations. 

The MA_diff Table(s) reports the transcript levels of the experiment (see EXPT ID: 
108436, 108437, 108438). For transcripts that had higher levels in the samples than the 
control, a is shown. A is shown for when transcript levels were reduced in root tips 
as compared to the control. For more experimental detail see the Example section below. 

Fruit genes are those sequences that showed differential expression as compared to 
controls, namely those sequences identified in the MA_diff tables with a "+" or "-" indication 

Fruit Genes Identified By Cluster Analyses Of Differential Expression 

Fruit Genes Identified By Correlation To Genes That Are Differentially Expressed 

As described above, the transcription profiles of genes that act together are well 

correlated. Applicants not only have identified the genes that are differentially expressed in 

the microarray experiments, but also have identified the genes that act in concert with them. 

The MA_clust table indicates groups of genes that have well correlated transcription profiles 

and therefore participate in the same pathway or network. 

A pathway or network of Fruit genes is any group in the MA_clust that comprises a 

cDNA ID that also appears in Expt ID 108436, 108437, 108438 of the MA_diff table(s). 

Fruit Genes Identified By Correlation To Genes That Cause Physiological 
Consequences 

Additionally, the differential expression data and the phenotypic observations can be 
merged to identify pathways or networks of Fruit genes. A group in the MA_clust is 
considered a Fruit pathway or network if the group comprises a cDNA ID that also appears in 
Knock-in or Knock-out tables that causes one or more of the phenotypes described in section 
above. 

Fruit Genes Identified By Amino Acid Sequence Similarity 
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Fruit genes from other plant species typically encode polypeptides that share amino 
acid similarity to the sequences encoded by corn and Arabidopsis Fruit genes. Groups of 
Fruit genes are identified in the Protein Group table. In this table, any protein group that 
comprises a peptide ID that corresponds to a cDNA ID member of a Fruit pathway or 
network is a group of proteins that also exhibits Fruit functions/utilities. 

- Use of fruit development responsive genes to modulate phenotypes 

Manipulation of the polynucleotides in the mature ovule, developing embryo, 
endosperm, seed coat and fruit enables many features of seed and fruit to be improved 
including the following: 

- Female fertility, megasporogenesis, embryo and endosperm development, 



ovule size, endosperm size, embryo size, seed size, seed yield, seed 
protein, seed oil, seed starch, seed cell number, cell size, seed coat 
development, organ size, dormancy and acquisition of desiccation 
tolerance, seed storage and longevity, seed germination, apomixis, 
production of seedless fruit and vegetables and hybrid seed production. 



To improve any of the phenotype(s) above, activities of one or more of the fruit 
development responsive polynucleotides and polynucleotide products can be modulated and 
the plants can be tested by screening for the desired trait. Specifically, the polynucleotide, 
mRNA levels, or protein levels can be altered in a plant utilizing the procedures described 
herein and the phenotypes can be assayed. As an example, a plant can be transformed 
according to Bechtold and Pelletier (1998, Methods. Mol. Biol. 82:259-266) and visually 
inspected for the desired phenotype or metabolically and/or functionally assayed. 

Use of fruit development responsive genes to modulate biochemical activities 

The activities of one or more of the fruit-expressed polynucleotides and polynucleotide 
products can be modulated to change biochemical or metabolic activities and/or pathways 
such as those noted below. Such biological changes can be achieved and measured according 
to citations such as the following: 



1 . Winkler et al. (1998). Plant Physiol. 1 1 8, 743-750 
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2. Weigel et al. (2000). Plant Physiol. 122, 1003-1014 

3. Cosgrove (1997). Plant Cell 9, 1031-1041 

4. Jacobs (1997). Plant Cell 9, 1021-1029 

5. Reismeier et al. (1994). EMBO J. 13, 1-7 
5 6. Carland et al. (1999). Plant Cell 11, 2123-2138 

7. Cheng etal. (1996). Plant Cell 8, 971-983 

8. Weber et al. (1995). Plant Cell 7, 1835-1846 

9. Leyser and Furner (1 992). Development 1 1 6, 397-403 

10. Hayashi et al. (1998). Plant Cell 10, 183-196. 
10 11. Pyke (1999). Plant Cell 1 1, 549-556 

12. Lotan et al.(1998). Cell 93, 1 195-1205 

13. Lending and Larkins (1989). Plant Cell 1, 1011-1023 

14. Hong et al. (1996). Development 122, 2051-2058. 

15. Fernandez et al. (2000). Science 289, 436-438 
15 16. D'Aoust et al. (1999). Plant Cell 11, 2407-2418 

17. Bewley (1997). Plant Cell 9, 1055-1066 

18. Heath et al. (1986). Planta 169, 304-312 

19. Browse et al. (1986). Anal. Biochem. 152, 141-145 

20. D'Aoust et al. (1999). Plant Cell 11, 2407-2418 

c 20 

Jp Other biological activities that can be modulated by the fruit-specific developmental^ 

responsive polynucleotides and polynucleotide products are listed in Reference Tables. 
P Assays for detecting such biological activities are described in the table as well as in the 



Protein Domain tables. 
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Example: 
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Manipulate 
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Seo et al. 
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Weigel et al. 
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(2000). Plant 




Example: 




(1995). Science 




Physiol 122, 




Cell wall- 




270, 1988- 




1003-1014. 




associated 




1992. Tobacco 




Activation 




protein kinase 




MAP kinase: a 




tagging in 




WAK1. 




possible 
mediator in 




Arabidopsis. 




Example: 




wound signal 
















Thionins. 






Epple and 
Bohlmann 








transduction 










pathways. 




(1997). Plant 
















He et al. 
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genes, Wakl-5 
are expressed 
in specific 
organs of 
Arabidopsis. 

Epple and 
Bohlmann 

(1997) . Plant 
Cell 9, 509- 
520. 

Overexpressi 

on of an 

endogenous 

thionin 

enhances 

resistance of 

Arabidopsis 

against 

Fusarium 

oxysporum. 

Ichimura et al. 

(1998) . DNA 
Res. 5,341- 
5348. 
Molecular 
cloning and 
characterizatior 
of three cDNAs 
encoding 
putative 
mitogen- 
activated 



kinase for 
survival 
during the 
pathogen 
response. 
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protein kinase 
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Weigel et al. 




NPKl-like 


ovules, and 


Kovtun et al. 


tolerance to 


(2000). Plant 




protein kinase 


seeds. 


(2000). PNAS 


drought, 


Physiol 122, 




Example: 
DNA binding 
protein genes: 
CBF-like, 
DREB2A, 
RAP2.1. 


Manipulate 
seed 

dormancy. 

Manipulate 

germination 

frequency. 

Manipulate 
seed storage 
and viability. 


USA 97, 
2940-2945. 
Functional 
analysis of 
oxidative 
stress- 
activated 
mitogen- 
activated 
protein kinase 
cascade in 
plants. 


dessication, 
cold, salinity, 
in ovules, 
developing 
seed and seed. 

Test for 
changes in 
seed viability 
upon storage. 

Test for 
changes in 
germination 
frequencies. 


1003-1014. 
Activation 
tagging in 
Arabidopsis. 



Page 194 of 772 




Attorney DKt. 273U-1483F 

Client Dkt. 80090.005 





3. Response 
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(2000). Plant 
Physiol 122, 
1003-1014. 
Activation 
tagging in 
Arabidopsis. 



Browse et al. 
(1986). Anal. 
Biochem. 
152, 141-145. 
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Planta 169, 
304-312. 

Browse et al. 
(1986). Anal. 
Biochem. 
152, 141-145. 

D'Aoust et al. 
(1999). Plant 
Cell 11, 
2407-2418. 



Fruit development responsive polynucleotides are characteristically differentially 
transcribed in response to fluctuating developmental-specific polynucleotide levels or other 
signals, whether internal or external to a cell. MA_diff reports the changes in transcript 
levels of various fruit development responsive polynucleotides in fruits. 

These data can be used to identify a number of types of fruit development responsive 
polynucleotides. Profiles of some of these different fruit development responsive 
polynucleotides are shown in the table below with examples of the kinds of associated 
biological activities. Because development is a continuous process and many cell types are 
being examined together, the expression profiles of genes overlap between stages of 
development in the chart below. 



ENDOSPERM ENDOSPERM 
NUCLEAR CELLULARIZATION 
DIVISION 



ENDOSPERM ABSORPTION BY 
EMBRYO 



I- 



+ 



+ 





MU Mid Torpedo 

Gtobutv Heart 



5 6 

Uptumed-U 
Walking-stick 



7 8 ./ 9 X) 
DAYS AFTER FLOWERING (DAF) 



O 
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Transcript Levels 


Developmental 


Metabolic Pathways 


Examples of 




Process 




Biochemical 








Activity 


(0-5 mm)»(5-10 


Ovule Elongation 


- Hormone Production, 


-Transcription 
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null / I m. vy iiuii f 
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modulating cell 
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for protein 






- Lipoxygenase 


translation 
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Development 
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- Transporters 




Protein Deposition 


- Sucrose Mobilization and 


- Kinases 
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Transcript Levels 


Developmental 


Metabolic Pathways 
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Transcript Levels 


Developmental 
Process 


Metabolic Pathways 


Examples of 
Biochemical 
Activity 






desiccation tolerance 
-Senescence 


enzymes 
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I.D. DEVELOPMENT GENES, GENE COMPONENTS AND PRODUCTS 

LP, 1 . IMBIBITION AND GERMINATION RESPONSIVE GENES, GENE 
COMPONENTS AND PRODUCTS 

Seeds are a vital component of the world's diet. Cereal grains alone, which comprise 
-90% of all cultivated seeds, contribute up to half of the global per capita energy intake. The 
primary organ system for seed production in flowering plants is the ovule. At maturity, the 
ovule consists of a haploid female gametophyte or embryo sac surrounded by several layers 
of maternal tissue including the nucleus and the integuments. The embryo sac typically 
contains seven cells including the egg cell, two synergids, a large central cell containing two 
polar nuclei, and three antipodal cells, that . Pollination results in the fertilization of both egg 
and central cell. The fertilized egg develops into the embryo. The fertilized central cell 
develops into the endosperm. And the integuments mature into the seed coat. As the ovule 
develops into the seed, the ovary matures into the fruit or silique. Late in development, the 
developing seed ends a period of extensive biosynthetic and cellular activity and begins to 
desiccate to complete its development and enter a dormant, metabolically quiescent state. 
Seed dormancy is generally an undesirable characteristic in agricultural crops, where rapid 
germination and growth are required. However, some degree of dormancy is advantageous, 
at least during seed development. This is particularly true for cereal crops because it prevents 
germination of grains while still on the ear of the parent plant (preharvest sprouting), a 
phenomenon that results in major losses to the agricultural industry. Extensive domestication 
and breeding of crop species have ostensibly reduced the level of dormancy mechanisms 
present in the seeds of their wild ancestors, although under some adverse environmental 
conditions, dormancy may reappear. By contrast, weed seeds frequently mature with 
inherent dormancy mechanisms that allow some seeds to persist in the soil for many years 
before completing germination. 

Germination commences with imbibition, the uptake of water by the dry seed, and the 
activation of the quiescent embryo and endosperm. The result is a burst of intense metabolic 
activity. At the cellular level, the genome is transformed from an inactive state to one of 
intense transcriptional activity. Stored lipids, carbohydrates and proteins are catabolized 
fueling seedling growth and development. DNA and organelles are repaired, replicated and 
begin functioning. Cell expansion and cell division are triggered. The shoot and root apical 
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meristem are activated and begin growth and organogenesis. Schematic 4 summarizes some 
of the metabolic and cellular processes that occur during imbibition. Germination is complete 
when a part of the embryo, the radicle, extends to penetrate the structures that surround it. In 
Arabidopsis, seed germination takes place within twenty- four (24) hours after imbibition. As 
such, germination requires the rapid and orchestrated transcription of numerous 
polynucleotides. Germination is followed by expansion of the hypocotyl and opening of the 
cotyledons. Meristem development continues to promote root growth and shoot growth, 
which is followed by early leaf formation. 



Schematic 4. Cellular and Molecular Events in Seed Imbibition and Germination 



24 HRS AFTER IMBIBITION 
1 



Dry Seed 



radicle 




500 ion 




Time 



Table reference: Bewley J 997 
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Genes with activities relevant to imbibition-germination and early seedling growth are 
described in the two sections A and B below. 



D.2.a. IDENTIFICATION OF IMBIBITION AND GERMINATION GENES 
Imbibition and germination includes those events that commence with the uptake of 
water by the quiescent dry seed and terminate with the expansion and elongation of the 
shoots and roots. The germination period exists from imbibition to when part of the embryo, 
usually the radicle, extends to penetrate the seed coat that surrounds it. 

Imbibition and germination genes identified herein are defined as genes, gene 
components and products capable of modulating one or more processes of imbibition and 
germination described above. They are useful to modulate many plant traits from early vigor 
to yield to stress tolerance. Examples of such germination genes and gene products are 
shown in the Reference and Sequence Tables. The functions of many of the genes were 
deduced from comparisons with known proteins and are also given in the REF Tables. 

Imbibition and Germination Genes Identified by Phenotypic Observations 
Imbibition and germination genes are active, potentially active or more active during 
growth and development of a dry seed into a seedling. These genes herein were discovered 
and characterized from a much larger set of genes in experiments designed to find genes that 
cause poor germination. 

In these experiments, imbibition and germination genes were identified by either 1) 
ectopic expression of a cDNA in a plant or (2) mutagenesis of the plant genome. The seeds 
were then imbibed and cultivated under standardized conditions and any phenotypic 
differences in the modified plants compared with the parental "wild-type" seedlings were 
recorded. The genes causing the changes were deduced from the cDNA inserted or gene 
mutated. The phenotypic differences observed were poor germination and aberrant seedlings. 



Imbibition and Germination Genes Identified by Differential Expression 
Germination genes were also identified by measuring the relative levels of mRNA 
products of genes in different stages of germination of a seed versus the plant as a whole. 
Specifically, mRNA was isolated from whole imbibed seeds of Arabidopsis plants 1, 2, 3 or 4 
days after imbibition and compared to mRNA isolated from dry seed-utilizing microarray 
procedures. The MA_diff Table reports the transcript levels of the experiment. For transcript 
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levels that were higher in the imbibed seed than in dry seed a u +" is shown. A "-" is shown 
when the transcript levels in dry seed were greater than those in imbibed seed. For more 
experimental detail, see the examples below: 

Germination associated genes can be identified by comparing expression profiles of 
imbibed gibberellin treated and untreated gal mutant seed. Germination associated genes can 
also be identified by comparing expression profiles in late maturation seed from wild-type 
and mutants that are defective for the establishment of dormancy and can germinate 
precociously (e.g. abal, aba2, abi4 in arabidopsis and vpl, vp5 in maize) or are defective for 
the specification of cotyledon identity and dessication tolerance (e.g. led, lec2, and fus3). 

The MAdiff Table(s) reports the transcript levels of the experiment (see EXPT ID: 
108461, 108462, 108463, 108464, 108528, 108529, 108530, 108531, 108545, 108546, 
108547, 108518, 108529, 108543, 108544). For transcripts that had higher levels in the 
samples than the control, a "+" is shown. A "-" is shown for when transcript levels were 
reduced in root tips as compared to the control. For more experimental detail see the 
Example section below. 

Imbibed & Germinating Seeds genes are those sequences that showed differential 
expression as compared to controls, namely those sequences identified in the MA_diff tables 
with a "+" or "-" indication 

Imbibed & Germinating Seeds Genes Identified By Cluster Analyses Of Differential 
Expression 

Imbibed & Germinating Seeds Genes Identified By Correlation To Genes That Are 
Differentially Expressed 

As described above, the transcription profiles of genes that act together are well 
correlated. Applicants not only have identified the genes that are differentially expressed in 
the microarray experiments, but also have identified the genes that act in concert with them. 
The MA_clust table indicates groups of genes that have well correlated transcription profiles 
and therefore participate in the same pathway or network. 

A pathway or network of Imbibed & Germinating Seeds genes is any group in the 
MA_clust that comprises a cDNA ID that also appears in Expt ID 108461, 108462, 108463, 
108464, 108528, 108529, 108530, 108531, 108545, 108546, 108547, 108518, 108529, 
108543, 108544 of the MA_diff table(s). 
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Imbibed & Germinating Seeds Genes Identified By Correlation To Genes That Cause 
Physiological Consequences 

Additionally, the differential expression data and the phenotypic observations can be 
merged to identify pathways or networks of Imbibed & Germinating Seeds genes. A group in 
the MAclust is considered a Imbibed & Germinating Seeds pathway or network if the group 
comprises a cDNA ID that also appears in Knock-in or Knock-out tables that causes one or 
more of the phenotypes described in section above. 

Imbibed & Germinating Seeds Genes Identified By Amino Acid Sequence Similarity 
Imbibed & Germinating Seeds genes from other plant species typically encode 
polypeptides that share amino acid similarity to the sequences encoded by com and 
Arabidopsis Imbibed & Germinating Seeds genes. Groups of Imbibed & Germinating Seeds 
genes are identified in the Protein Group table. In this table, any protein group that comprises 
a peptide ID that corresponds to a cDNA ID member of a Imbibed & Germinating Seeds 
pathway or network is a group of proteins that also exhibits Imbibed & Germinating Seeds 
functions/utilities. 

D.2.b. USE OF IMBIBITION AND GERMINATION GENES, GENE 
COMPONENTS AND PRODUCTS TO MODULATE PHENOTYPES 
Imbibition and germination genes and gene products can be divided into those that act 
during primary events, secondary events, and/or termination. The genes and gene products of 
the instant invention are useful to modulate any one or more of the phenotypes described 
below: 

I. Primary events 

A. Dormancy 

Imbibition and germination genes and gene products of the invention can act to 
modulate different types of dormancy including: 

1 . Primary dormancy - dormancy is established during seed development 
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2. Seed coat-imposed dormancy - dormancy is imposed by blocking 
water uptake, mechanical restraint of embryo, blocking the exit of 
inhibitors 

3. Embryo dormancy - cotyledon mediated inhibition of embryonic axis 
growth 

4. Secondary dormancy - dormancy is induced when dispersed, mature 
seeds are exposed to unfavorable conditions for germination (e.g. 
anoxia, unsuiTable temperature or illumination). 

5 . Hormone-induced 



B. Dormancy-breaking signal perception and transduction 
Germination genes and gene products include those that are able to modulate 
the response to dormancy releasing signals such as: 

1 . Fruit ripening and seed development 

2. Imbibition 

3. Temperature - low and high (range 0-23°) 

4. Light - particularly for coat imposed dormancy 

a. White light 

b. Intermittent illumination 

c. Orange and red region of the spectrum (longer than 700 
or 730 nm) 

d. Phytochrome 

5. Coat softening 

6. Chemicals 

a. Respiratory inhibitors 

b. Sulfhydryl compounds 

c. Oxidants 

d. Nitrogenous compounds 

e. Growth regulators - GA, BA, ethylene 

f Various, ethanol, methylene blue, ethyl ether, fiisicoccin 

7. Oxygen and Carbon dioxide 

8. Stress 
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n. Secondary Events 

During the secondary events of germination, dormancy-maintaining metabolism is 
repressed, dormancy-breaking metabolism is induced and structures surrounding the embryo 
weaken (where present). Germination genes and gene products are useful to modulate 
processes of the secondary events as follows: 



B. Respiration - oxygen consumption 

The genes and genes products of the invention can regulate the 
following pathways which resume during the first respiratory burst of 
germination: 

1. Glycolysis 

2. Pentose phosphate 

3. Citric acid 

4. Tricarboxylic acid cycle 

C. Mitochondrial development 



Tissues of the mature dry seed contain mitochondria, and although these organelles 
are poorly differentiated as a consequence of the drying, they contain sufficient Kreb's cycle 
enzymes and terminal oxidases to provide adequate amount of ATP to support metabolism 
for several hours after imbibition. During germination of embryos, there appears to be two 
distinct patterns of mitochondrial development. In starch-storing seeds, repair and activation 
of preexisting organelles predominate, whereas oil-storing seeds typically produce new 
mitochondria. Germination genes and gene products of the invention are useful to modulate 
the repair, activation and biogenesis pathways of mitochondria. Specific examples are as 
follows: 



A. 



Water uptake 

1. Cell expansion 

2. Change in osmotic state 
1. Ion exchange 



4. 



2. 



3. 



1. 



Membrane formation and repair 
DNA repair and synthesis 
Protein synthesis 

Coordinated regulation of mitochondrial and nuclear genomes 
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D. 



Metabolism 



In addition to respiration and organelle activity, enzyme activity, DNA repair, RNA 
synthesis and protein synthesis are fundamental cellular activities intimately involved in the 
completion of germination and the preparation for subsequent growth. Imbibition and 
germination genes and gene products of the invention can participate in or modulate these 
activities. Examples are: 



HI. Termination 

The last stage of seed germination is characterized by the emergence of the radicle or 
root apex through the seed coat. Typically, the cell walls loosen and the radicle extends from 
the embryo during late germination. Germination genes and gene products are useful to 
modulate the mobilization of stored reserves, DNA synthesis and cell division that are typical 
of this stage of germination. 

To regulate any of the phenotype(s) above, activities of one or more of the late 
germination genes or gene products can be modulated and tested by screening for the desired 
trait. Specifically, the gene, mRNA levels, or protein levels can be altered in a plant utilizing 
the procedures described herein and the phenotypes can be assayed. As an example, a plant 
can be transformed according to Bechtold and Pelletier (1998, Methods. Mol. Biol. 82:259- 
266) and/or screened for variants as in Winkler et al. (1998) Plant Physiol 118: 743-50 and 
visually inspected for the desired phenotype or metabolically and/or functionally assayed 
according to Dolan et al. (1993, Development 119: 71-84), Dolan et al. (1997, Development 
124: 1789-98), Crawford and Glass (1998, Trends Plant Science 3: 389-95), Wang et al. 
(1998, PNAS USA 95: 15134-39), Gaxiola et al. (1998, PNAS USA 95: 4046-50), Apse et al. 
(1999, Science 285: 1256-58), Fisher and Long (1992, Nature 357: 655-60), Schneider et al. 
(1998, Genes Devel 12: 2013-21) and Hirsch (1999, Curr Opin Plant Biol. 2: 320-326). 



4. 



2. 



3. 



1. 



ABA response(for more detail see section on ABA-responsive 
genes) 

GA response(for more detail see section on GA-responsive 
genes) 

ATP synthesis and Adenylate Energy Charge during 
germination 

The synthesis and utilization of reducing power: pyridine 
nucleotides (NADH and NADPH) 
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D.2.C. USE OF IMBIBITION AND GERMINATION GENES, GENE 
COMPONENTS AND PRODUCT TO MODULATE BIOCHEMICAL ACTIVITIES 
The roles of the biochemical changes associated with imbibition and germination can 
5 be appreciated from a summary of the processes occurring. 

Physiology 

Water plays an important role throughout the plant life cycle. The most dramatic 
10 example of this is in seed germination. Although germination is triggered by water, the 
germination response is also positively regulated by the plant growth regulators the 
gibberellins and negatively affected by the growth regulator abscisic acid. Genes that are 
activated by water and genes that are activated by gibberellins can be identified through 
O expression profiling experiments using arabidopsis mutants defective for gibberellin 
ij=5 biosynthesis or perception (gal, gai), abscisic acid biosynthesis or perception (abal, abi3, and 
% : abi4) in the presence or absence of exogenous gibberellins. These genes can be used to 
fu promote seedling growth and development and other phases of plant development. 

hs '■ 
■f T1 

JL. Transcriptional Control of Gene Activity 

p0 At the end of seed development, dessication and dormancy have imposed a global 

Pi 

state of repression on gene activity throughout the seed. Reactivation of the genome requires 
water and gibberellins. One function of the genes that are activated early by imbibition is the 
rapid and dramatic reversal of gene repression. For example, expression-profiling 
experiments revealed that several thousand genes are hyperactivated in arabidopsis upon 
25 imbibition. These include genes involved in metabolic pathways, genes that promote cell 
growth and division, and transcriptional control genes. Thus one class of genes expressed 
early in imbibition includes those that promote high levels of gene expression. Other early 
genes are responsible for regulating specific metabolic, cell, and developmental processes. 
The strategy for distinguishing these functions was outlined in the Introduction. 

30 

Mobilization of Storage Reserves 

In contrast to the synthesis and accumulation of reserves during seed development an 
important function of genes expressed during imbibition and germination is the control of the 
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mobilization and catabolism of seed storage reserves in the endosperm (in grasses and 
cereals) and the embryo. The mobilization of seed storage reserves is triggered by imbibition 
and may occur over several days. There are three classes of high molecular weight seed 
storage reserves: carbohydrates, triacylglycerols, and storage proteins. Upon imbibition seed 
storage reserves are converted into forms that can be transported and metabolized. Genes 
encoding enzymes for storage reserve catabolism are expressed shortly after imbibition. 
Starch for example is converted to sucrose. Triacylglycerols are converted into acetyl-CoA. 
Storage proteins are converted into amino acids or deaminated to provide carbon skeletons 
for oxidation. 

Carbohydrate Catabolism 

Starch is the most common storage carbohydrate in seeds. The primary components of 
starch are amylose and amylopectin. 

Mobilization 

There are two pathways for starch catabolism - hydrolytic and phosphorolytic. The 
product of these pathways is the monosaccharide glucose. Examples of the enzymes 
responsible for hydrolytic catabolism of starch are: amylase, glucosidase, amylase, 
dextrinase, isoamylase. The enzyme responsible for phosphorolytic activity is starch 
phosphorylase. 

Transport 

The mobilization of starch involves the synthesis of sucrose from glucose, which can 
then be transported to sites for growth in the root and shoot. In some seeds, maltose may be a 
major form of transported carbohydrate. The production of sucrose-6-P from glucose 
involves the following enzymes: UDP-glucose pyrophosphorylase, sucrose-6-P synthetase, 
and sucrose phosphatase. 

Sucrose Catabolism 

In target tissues sucrose is hydrolyzed by fructofiiransidase (invertase) and/or sucrose 
synthetase. The synthesis of glucose from glucose- 1-P involves sucrose synthetase. 
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Cell Biology 

The lumen of the endoplasmic reticulum (ER) is target for other hydrolase activities 
including mannosidase, glucosaminidase, acid phosphatase, phosphodiesterase, and 
phospholipase D. 

TRIACYLGLYCEROL (TAG) CATABOLISM 

Triacylglycerols are the major storage lipids of seeds. The products of TAG 
catabolism in imbibed and germinating seed are glycerol and free fatty acids. Most of the 
glycerol is converted to sucrose for export. Free fatty acids are catabolized through oxidation 
through the glyoxylate cycle and gluconeogenesis. 

Mobilization 

Hydrolysis of triacylglycerols is by lipases yielding glycerol and free fatty acids. Free 
fatty acids are oxidized to acetyl-CoA and propionyl-CoA via oxidation requiring ATP and 
coenzyme A. Catabolism of unsaturated fatty acids also requires cis, trans-isomerases, 
epimerases, and hydratases. Acetyl-CoA is oxidized through the citric acid cycle to C02 and 
H20. More importantly, acetyl-CoA can be utilized via the glyoxylate cycle and 
gluconeogenesis for glucose synthesis. Free fatty acids are also broken down via oxidation. 
Glycerol is converted via phosphorylation and oxidation to DHAP and G3P, which are used 
to synthesize glucose or oxidized via the citric acid cycle. Examples of other induced 
enzymes include isocitrate lyase and malate synthetase 

Transport 

Most of the glycerol, acetyl-CoA, and propionyl-CoA are converted to sucrose for 
transport. This requires the enzymes glycerol kinase and glycerol phosphate oxidoreductase. 

Cell Biology 

Glyoxysome biogenesis is required to support fatty acid catabolism and 
gluconeogenesis. Upon exposure to light there is a loss of glyoxysomes due to their conversion 
to peroxisomes. 

STORAGE PROTEIN CATABOLISM 
Mobilization 
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The hydrolysis of storage proteins to amino acids is performed by a diverse group of 
proteinases and peptidases. The peptidases include endopeptidases, aminopeptidases, and 
carboxypeptidases. They include the A and B class proteinases. The liberated amino acids are 
available for protein synthesis, for deamination and reutilization of ammonia via glutamine 
and asparagine synthesis, and to provide carbon skeletons for respiration. Several enzymes 
including, deaminase, asparagine synthetase, glutamine synthetase and glutamate 
dehydrogenase are important players in the mobilization and utilization of stored nitrogen in 
imbibed seed. 

Transport 

The major transported form of amino acid in germinated seeds is asparagine. In some 
species glutamine and/or homoserine are the major form of transported amino acid. 
Aspartate, glutamate, alanine, glycine, and serine can be converted to sucrose and transported 
as sucrose. Other amino acids are transported unchanged. 

Cell Biology 

Proteinases are sequestered in lumen of endoplasmic reticulum (ER) which then fuses 
with protein bodies. 

While catabolism is high in the storage tissues of imbibed seed the products of 
catabolism are transported to sites of growth including the shoot and root apices fueling 
respiration, biosynthesis, cell division and differentiation. 

DEVELOPMENT 

Imbibition triggers several key processes for seedling development. One is the 
activation of the shoot and root apical meristems. The shoot apical meristem is responsible 
for two primary growth activities. One is the production of the protoderm, procambium and 
ground meristem. The protoderm gives rise to the epidermal system of the plant, the 
procambium to the primary vascular tissues, and the ground meristem to the ground tissues 
including the cortex and pith. The second is the production of leaf primordia, which arise on 
the flanks of the apex. Thus, activation of the shoot apical meristem results in shoot growth 
and organogenesis. 

The root apical meristem, by contrast is responsible for vegetative root development. 
The first primary growth activity of the root apical meristem is the production of the 
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protoderm, procambium and ground meristem. The second primary growth activity is the 

production of the cells that give rise to the root cap. 

Genes that govern shoot apical meristem activation and development can be identified 

in arabidopsis by gene profiling experiments comparing gene expression in wild-type 
5 imbibed seed and partial loss-of-function stm (shootmeristemless ) mutants (see SAM). 

Genes governing root meristem activity can be identified by gene profiling experiments 

comparing gene expression in wild-type imbibed seed and rml (rootmeristemless) mutants. 
Genes identified in this way are useful to promote or retard meristem growth, modify 

and strengthen shoot and root development, promote leaf development as described below. 
10 Changes in the concentration of imbibition-germination activated polynucleotides 

result in the modulation of many other polynucleotides and polynucleotide products. 

Examples of such activated responsive polynucleotides and polynucleotide products relative 

to leaves and floral stem and to fruits at different development stages are shown in the 
O Reference and Sequence Tables. These polynucleotides and/or products are responsible fore 
Ji;5 effects on traits such as seedling growth, seedling viability, and seedling vigor. The 
5"! polynucleotides were discovered by isolating seeds from Arabidopsis wild-type ecotype 

fy "Wassilewskija" imbibed for 24 hours, and measuring the mRNAs expressed in them relative 

Ln 

gi to those in a leaf and floral stem sample and to those in fruits at different developmental 
stages. 

CiO While imbibition-germination activated polynucleotides and polynucleotide products 

ry 

£ can act alone, combinations of these polynucleotides also affect germination. Useful 

combinations include different polynucleotides and/or polynucleotide products that have 
similar transcription profiles or similar biological activities, and members of the same or 
functionally similar biochemical pathways. In addition, the combination of imbibition 

25 germination activated polynucleotides and/or polynucleotide products with environmentally 
responsive polynucleotides is also useful because of the interactions that exist between 
development, hormone-regulated pathways, stress and pathogen induced pathways and 
nutritional pathways. Here, useful combinations include polynucleotides that may have 
different transcription profiles, and participate in common or overlapping pathways but 

30 combine to produce a specific, phenotypic change. 

Such imbibition and germination activated polynucleotides and polynucleotide 
products can function to either increase or dampen the above phenotypes or activities either 
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in response to transcript changes in fruit development or in the absence of fruit-specific 
polynucleotide fluctuations. 





PROCESS 


BIOCHEMICAL OR 
METABOLIC ACTIVITIES 
AND/OR PATHWAYS 
ALTERED 


CITATIONS INCLUDING 
ASSAYS 




Growth , Differentiation 


Farnesylation Mediated Seed 


Pei et al (1998) Science 282: 




and Development 


Dormancy 


287-290; Cutler et al. (1996) 
Science 273: 1239 




Metabolic activity 


Nitrogen metabolism 


Goupil et al (1998) J Exptl 
Botany 49:1855-62 




Metabolic activity 


-H+ export and membrane 
hyperpolarization 


Cerana et al. (1983) 


•srr 


Metabolic activity 


Chloroplast functioning 


Benkova et al (1999) Plant 


s.5 1 






Physil 121:245-252 


t^i 


Growth, Differentiation 


Regulation of Morphogenesis 


Riou-Khamlichi et al. (1999) 


z tr 

W 


and development 




Science 283: 1541-44 


z : : 
*c= : 

3 


Metabolic activity 


Cell Death 


Lohman et al. (1994) Physiol 


•ecr 






Plant 92: 322-328 


-=rr 

f? ; 

: ; : 
e tr* 


Growth and development 


Promotion of cell division 


Kakimoto (1996) Science 


*■ 




Shoot formation in absence of 


274: 982-985 






exogenous cytokinin 






Metabolic activity 


Membrane repair 


Heath et al. (1986) Planta 169: 
304-12 

Browse et al. (1986) Anal 
Biochem 152: 141-5 
D'Aoust et al (1999) Plant 
Cell 11:2407-18 




Metabolism 


Organic molecule export 


Moody et al. (1988) 
Phytochemistry 27: 2857-61 




Metabolic activity 


Nutrient Uptake 


Uozumio et al. (2000) Plant 
Physiol 122: 1249-59 




Metabolic activity 


Ion export 


Uozumi et al. (2000) Plant 
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PROCESS 


BIOCHEMICAL OR 
METABOLIC ACTIVITIES 
AND/OR PATHWAYS 
ALTERED 


CITATIONS INCLUDING 
ASSAYS 






Physiol 122: 1249-59 
Frachisse et al. (2000) Plant J 
21: 361-71 


Growth, Differentiation 
and development 


Division and/or elongation 


Zhang and Forde (1998) 
Science 279: 407-409. 
Coruzzi et al. U.S. Pat. No. 
5,955,651 


Metabolic activity 


Regulation of Molecular 
chaperones 


Wisniewski et al. (1999) 
Physiolgia Plantarum 105: 
600-608 


Metabolic activity 


Reactivation of Aggregation 
and Protein Folding 


Lee and Vierling (2000) Plant 
Physiol. 122: 189-197 


Metabolic activity 


Maintenance of Native 
Conformation (cytosolic proteins) 


Queitsch et al. (2000) The 
Plant Cell 12: 479-92 


Metabolic activity 


Regulation of Translational 
Efficiency 


Wells et al. (1998) Genes and 
Development 12: 3236-51 


Metabolic activity 


DNA Repair 


Bewley (1997) Plant Cell 9: 
1055-66 


Metabolic activity 


Protein Synthesis using stored 
or newly synthesized mRNAs 


Heath et al. (1986) Planta 169: 
304-12 


Metabolic activity 


Mitochondrial repair and 
synthesis 


MacKenzie and Mcintosh 
(1999) Plant Cell 11: 571-86 


Metabolic activity 


Commencement of respiration 


Debeaujon et al. (2000) Plant 
Physiol 122: 403-4132 




Water Uptake 


Debeaujon et al. (2000) Plant 
Physiol 122: 403-4132 
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Other biological activities that are modulated by the imbibition-activated 
polynucleotides and polynucleotide products are listed in the Reference Tables. Assays for 
detecting such biological activities are described in the Table below as well as in the Domain 
section of the Reference Table. 

D.2.d. USE OF IMBIBITION AND GERMINATION GENES TO MODULATE 
THE TRANSCRIPTION LEVELS OF OTHER GENES 

The expression of many genes is "upregulated" or "downregulated" during imbibition 
and germination because some imbibition and germination genes are integrated into complex 
networks that regulate transcription of many other genes. Some imbibition and germination 
genes are therefore useful for regulating other genes and hence complex phenotypes. 

Imbibition-activated polynucleotides may also be differentially transcribed in 
response to fluctuating developmental-specific polynucleotide levels or concentrations, 
whether internal or external to a cell, at different times during the plant life cycle to promote 
associated biological activities. These activities are, by necessity, a small subset of the genes 
involved in the development process. Furthermore, because development is a continuous 
process with few clear demarcations between stages, the associated metabolic and 
biochemical pathways overlap. Some of the changes in gene transcription are summarized in 
the Table below: 



DEVELOPMENTAL 


PHYSIOLOGICAL/METABO 


EXAMPLES OF 


PROCESS REGULATED 


LIC CONSEQUENCES OF 


BIOCHEMICAL 


BY IMBIBITION- 


MODIFYING GENE 


REGULATORY 


GERMINATION GENES 


PRODUCT LEVELS 


ACTIVITIES 






ASSOCIATED WITH 






IMBIBITION AND 






GERMINATION 


Tissue Specialization 


- Lipid Catabolism 


- Transcription Factors 




- Lipoxygenase 


- Transporters 


- Cotyledon Expansion 


Localization 


- Kinases 


- Endosperm (???) 


- Starch Catabolism 


- Changes in cytoskeletal 


- Activation of the Shoot 


- Seed Protein Catabolism 


protein activity 


Apical Meristem 


- Growth Regulator Production, 


modulating cell structure 
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- Activation of the Root 


Transport, Perception, 


- Stability of factors for 


Apical Meristem 


Signaling, Response (e.g., 


protein translation 




Gibberellins, Ethylene, 


- Changes in cell 


- Radicle Growth 


Auxin) 


wall/membrane structure 


- Vascular System 




- Chromatin structure 


Development 


- Global Gene Activation 


and/or DNA topology 




- Transcription Initiation 


- Biosynthetic enzymes 




- Sucrose Synthesis and 


- Metabolic enzymes 




Partitioning 






- Sucrose catabolism 






- Sucrose Signaling 






- Cell Wall Biosynthesis 






- Activators of Metabolic 






Pathways 






- Protein Remodeling 




Organelle Differentiation 


- Cell Wall Biosynthesis 


- Transcription Factors 


and Development 


- Membrane Repair and 


- Transporters 




Synthesis 


- Kinases 




- Specific Gene Transcription 


- Chaperones 




Initiation 


- Changes in cytoskeletal 




- Sucrose Mobilization and 


protein activity 




Partitioning 


modulating cell structure 




- Sucrose Signaling 


- Stability of factors for 




- Activators of Metabolic 


protein translation 




Pathways 


- Changes in cell 




- Auxin Perception, 


wall/membrane structure 




Response and Signaling 


- Chromatin structure 




- Protein Remodeling 


and/or DNA topology 




- Lipid Mobilization, 


- Biosynthetic enzymes 




Metabolism and Biosynthesis 


- Metabolic enzymes 




- Protein Transport, 






Metabolism, and Biosynthesis 
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DNA Repair 


- Cell Division 


- Transcription Factors 




- Cell Cycle Control 


- Transporters 




- DNA Replication 


- Kinases 




- Specific Gene Transcription 


- Chaperones 




Initiation 


for protein translation 






- Changes in cell 




- Protein Remodeling 


wall/membrane structure 




- Protein Synthesis 


- Chromatin structure 






and/or DNA topology 




- Repressors of Senescence 


- Biosynthetic enzymes 


Cellular Metabolism 


- Lipid Catabolism 


- Transcription Factors 




- oxidation 


- Transporters 




- Glyoxylate cycle 


- Kinases 




- Citric acid cycle 


- Chaperones 




- Gluconeogenesis 


- Translation Initiation 




- Sucrose Synthesis and 


Factors 




Partitioning 


- Biosynthetic Enzymes 




- Starch Catabolism 


- Metabolic Enzymes 




- Seed Protein Catabolism 






- Asparagine Synthesis and 






Transport 






- Sucrose catabolism 






- Sucrose Signaling 






- Ribosome/polysome 






production and maintenance 






- Housekeeping genes 






- Respiration 






- Photosynthesis 





Changes in the processes of germination are the result of modulation of the activities of 
one or more of these many germination genes and gene products. These genes and/or products 
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are responsible for effects on traits such as fast germination, plant vigor and seed yield, 
especially when plants are growing in the presence of biotic or abiotic stresses or when they are 
growing in barren conditions or soils depleted of certain minerals. 

Germination genes and gene products can act alone or in combination as described in the 
introduction. Of particular interest are combination of these genes and gene products with those 
that modulate stress tolerance and/or metabolism. Stress tolerance and metabolism genes and 
gene products are described in more detail in the sections below. 

Use of Promoters of Imbibition and Germination Genes 

These promoters can be used to control expression of any polynucleotide, plant or non- 
plant, in a plant host. Selected promoters when operably linked to a coding sequence can direct 
synthesis of the protein in specific cell types or to loss of a protein product, for example when 
the coding sequence is in the antisense configuration. They are thus useful in controlling 
changes in imbibition and germination phenotypes or enabling novel proteins to be made in 
germinating seeds. 
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LD.2. EARLY SEEDLING-PHASE SPECIFIC RESPONSIVE GENES, 
GENE COMPONENTS AND PRODUCTS 
One of the more active stages of the plant life cycle is a few days after germination is 
complete, also referred to as the early seedling phase. During this period the plant begins 
development and growth of the first leaves, roots, and other organs not found in the embryo. 
Generally this stage begins when germination ends. The first sign that germination has been 
completed is usually that there is an increase in length and fresh weight of the radicle. 

a) IDENTIFICATION OF EARLY SEEDLING PHASE GENES, 

GENE COMPONENTS AND PRODUCTS 
These genes defined and identified herein are capable of modulating one or more 
processes of development and growth of many plant organs as described below. These genes 
and gene products can regulate a number of plant traits to modulate yield. Examples of such 
early seedling phase genes and gene products are shown in the Reference and Sequence, 
Knock-in, Knock-out and MA-diff Tables. The functions of the protein of some of these 
genes are also given in these Tables. 

Early Seedling Genes Identified by Phenotypic Observations 

Some early seedling genes were discovered and characterized from a much larger set 
of genes by experiments designed to find genes that cause phenotypic changes in germinating 
seeds as the transitioned into seedlings. 

In these experiments, leaf genes were identified by either (1) ectopic expression of a 
cDNA in a plant or (2) mutagenesis of the plant genome. The plants were then cultivated and 
one or more of the following leaf phenotypes, which varied from the parental "wild-type", 
were observed: 

• Abnormal growth 

• Abnormal cotyledons or root growth 

• Reduced growth 

• Abnormal first leaf 

• Abnormal hypocotyl 

• Abnormal pigmentation 
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The genes identified by these phenotypes are given in the Knock-in and Knock-out Tables. 



Early Seedling Phase Genes Identified By Differential Expression 
Such genes are active or potentially active to a greater extent in developing and rapidly 
growing cells, tissues and organs, as exemplified by development and growth of a seedling 3 or 
4 days after planting a seed. These genes herein were also discovered and characterized from a 
much larger set of genes in experiments designed to find genes. Early seedling phase genes 
were identified by measuring the relative levels of mRNA products in a seedling 3 or 4 days 
after planting a seed versus a sterilized seed. Specifically, mRNA was isolated from aerial 
portion of a seedling 3 or 4 days after planting a seed and compared to mRNA isolated from a 
sterilized seed utilizing microarray procedures. The MA_diff Table(s) reports the transcript 
levels of the experiment (see EXPT ID: Sqn (relating to SMD 7133, SMD 7137)). For 
transcripts that had higher levels in the samples than the control, a is shown. A "-" is shown 
for when transcript levels were reduced in root tips as compared to the control. For more 
experimental detail see the Example section below. 

Early Seedling Phase genes are those sequences that showed differential expression as 
compared to controls, namely those sequences identified in the MA_diff tables with a "+" or "-" 
indication 



Early Seedling Phase Genes Identified By Cluster Analyses Of Differential 
Expression 

Early Seedling Phase Genes Identified By Correlation To Genes That 
Are Differentially Expressed 

As described above, the transcription profiles of genes that act together are well 
correlated. Applicants not only have identified the genes that are differentially expressed in the 
microarray experiments, but also have identified the genes that act in concert with them. The 
MA_clust table indicates groups of genes that have well correlated transcription profiles and 
therefore participate in the same pathway or network. 

A pathway or network of Early Seedling Phase genes is any group in the 
MA_clust that comprises a cDNA ID that also appears in Expt ID Sqn (relating to SMD 7133, 
SMD 7137) of the MA_diff table(s). 



Early Seedling Phase Genes Identified By Correlation To Genes That 
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Cause Physiological Consequences 

Additionally, the differential expression data and the phenotypic observations 
can be merged to identify pathways or networks of Early Seedling Phase genes. A group in the 
MA_clust is considered a Early Seedling Phase pathway or network if the group comprises a 
cDNA ID that also appears in Knock-in or Knock-out tables that causes one or more of the 
phenotypes described in section above. 

Early Seedling Phase Genes Identified By Amino Acid Sequence Similarity 
Early Seedling Phase genes from other plant species typically encode 
polypeptides that share amino acid similarity to the sequences encoded by corn and Arabidopsis 
Early Seedling Phase genes. Groups of Early Seedling Phase genes are identified in the Protein 
Group table. In this table, any protein group that comprises a peptide ID that corresponds to a 
cDNA ID member of a Early Seedling Phase pathway or network is a group of proteins that also 
exhibits Early Seedling Phase functions/utilities. 

Of particular interest are early seedling phase genes that are differentially expressed 3 
or 4 days after planting a seed but not differentially expressed germinating seeds and/or mature 
leaves. 

Examples of phenotypes, biochemical activities, and transcription profiles that can be 
modulated by these genes and gene products are described above and below. 

D.5.b. USE OF EARLY SEEDLING GENES, GENE COMPONENTS AND 
PRODUCTS TO MODULATE PHENOTYPES 

Rapid, efficient establishment of a seedling is very important in commercial 
agriculture and horticulture. It is also vital that resources are approximately partitioned 
between shoot and root to facilitate adaptive growth. Phototropism and geotropism need to 
be established. All these require post-germination process to be sustained to ensure that 
vigorous seedlings are produced. Early seedling phase genes, gene components and products 
are useful to manipulate these and other processes. 

I. Development 

The early seedling phase genes, gene components and products of the instant 
invention are useful to modulate one or more processes of the stages of leaf morphogenesis 
including: stage 1- organogenesis that gives rise to the leaf primordium; stage 2- delimiting 
basic morphological domains; and stage 3- a coordinated processes of cell division, 
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expansion, and differentiation. Early seedling phase genes include those genes that terminate 
as well as initiate leaf development. Modulating any or all of the processes leads to 
beneficial effects at specific locations . 

A. Gene Sequences Affecting Types of Leaves 

Applicants provide with these genes, gene components and gene products the means 
to modulate one or more of the following types of leaves, and stem: 

5. Cotyledons 

6. Major Leaves 

B. Gene Sequences Affecting Cell properties 

These genes, gene components and gene products are useful to modulate changes in: 



10. 


Cell size 


11. 


Cell division, rate and direction 


12. 


Cell elongation 


13. 


Cell differentiation 


14. 


Xylem and phloem cell numbers 


15. 


Cell wall composition 


16. 


All cell types 



C. Gene Sequences Affecting Leaf Architecture: 

The following properties of a leaf are useful to modulate to change overall leaf 
architecture: 

4. Veination - Improvements in photosynthetic efficiency, stress 
tolerance efficiency of solute and nutrient movement to and from the 
leaf are accomplished by increases or decreases in: 

(c) Vein placement 

(d) Number of cells in the vein 

5. Shape 

(c) Elongated versus rounded 

(d) Symmetry, around either 

- abaxial-adaxial (dorsiventral) axis 

- apical-basal (proximodistal) axis 

- margin-blade-midrib (lateral) axis 
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H. Genes Sequences Influencing Leaf Responses 
Shoot apical naeristem cells differentiate to become leaf primordia that eventually 
develop into leaves. The genes, gene components and gene products of this invention are 
useful to modulate any one or all of these growth and development processes, by affecting 
timing and rate or planes of cell divisions for example, in response to the internal plant 
stimuli and/or programs listed below: 

8. Embryogenesis 

9. Germination 

10. Hormones 

(b) Auxin (for more details see the section on auxin-responsive 
genes) 

11. Phototropism 

12. Coordination of leaf growth and development with that of other organs 

(f) Roots 

(g) Stems 

1 3 . Stress-related programs 

II. Interaction with the Environment 

Successful seedling establishment demands successful interaction with the 
environment in the soil. Early vegetation genes orchestrate and respond to interactions with 
the environment. Thus early seedling phase genes are useful for improving interactions 
between a plant and the environment that include: 

N. Pigment accumulation (see the section on Viability genes for more 
detail) 



O. Oxygen gain/loss control 

P. Carbon dioxide gain/loss control 

Q. Water gain/loss control 

R. Nutrient transport 

S. Light harvesting 

T. Chloroplast biogenesis 

U. Circadian rhythm control 

V. Light/dark adaptation 

W. Defense systems against biotic and abiotic stresses 
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X. Metabolite accumulation 
Y. Secondary metabolite production 

HI. Organizing Tissues for Photosynthesis and Metabolism 
5 Following germination and utilization of seed reserves, plant tissues prepare for 

photosynthesis and seedling metabolism. Leaf meristems, and root meristems participate in 
these changes before cell differentiation. Many of the uses for plants depend on the success 
of leaves as the powerhouses for plant growth, their ability to withstand stresses and their 
chemical composition. Leaves are organs with many different cell types and structures. 
10 Most genes of a plant are active in leaves and therefore leaves have very diverse of pathways 
and physiological processes. Examples of such pathways and processes that are modulated 
by early seedling phase genes, gene components and products include: 





X. 


Photosynthesis 


Pi 


Y. 


Sugar metabolism 


• s ■ 

¥ 


Z. 


Starch synthesis 




AA. 


Starch degradation 


I XT 


BB. 


Nitrate and ammonia metabolism 


w ; 


CC. 


Amino acid biosynthesis, transport 


3 


DD. 


Protein biosynthesis 


go 


EE. 


DNA replication, repair 


J* 


FF. 


Lipid biosynthesis and breakdown 




GG. 


Protein biosynthesis, storage and breakdown 


£ '' 


HH. 


Nucleotide transport and metabolism 




II. 


Cell envelope biogenesis 


25 


JJ. 


Membrane formation 




KK. 


Mitochondrial and chloroplast biogenesis 




LL. 


Transcription and RNA metabolism 




MM. 


Vitamin biosynthesis 




NN. 


Steroid and terpenoid biosynthesis 


30 


00. 


Devise secondary metabolite synthesis 




PP. 


Co-enzyme metabolism 




QQ. 


Flavonoid biosynthesis and degradation 




RR. 


Synthesis of waxes 




Attorney UkX. ZOO-MMF 
Client Dkt. 80090.005 



Page 225 of 772 



Attorney Ukt. 2 /MM48JF 
Client Dkt. 80090.005 



SS. Glyoxylate metabolism 

TT. Hormone perception and response pathways 



Uses of Plants that Are Modified as Described above 

Altering leaf genes or gene products in a plant modifies one or more the following 
plant traits, to make the plants more useful for specific purposes in agriculture, horticulture 
and for the production of valuable molecules. The useful plants have at least one of the 
following: 



A. More seedling vigor 

B. A higher yield of early leaves and their molecular constituents due to different: 

6. Number, size, weight, harvest index 

7. Composition including and amounts and types of 

carbohydrates, proteins, oils, waxes, etc. 

8. Photosynthetic efficiency e.g. reduced photorespiration 

9. Absorption of water and nutrients to enhance yields, including 

under stresses such as high light, herbicides, and heat. 

10. Pathways to accumulate new valuable molecules. 

I. More optimal leaf shape and architecture in early seedling- enhancing 
photosynthesis and enhancing appeal in ornamental species 

(e) size 

(f) number 

(g) pigment 

J. A better overall plant architecture - enhancing photosynthesis and 

enhancing appeal in ornamental species 
K. Reduced negative effects of high planting density, by altering leaf placement 

to be more vertical instead of parallel to the ground, for instance 

L. Better stress tolerance, including without limitation 

3. Drought resistance, by decreasing water loss, for example 

4. Pathogen resistance 
M. Better overall yield and vigor 
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Plant yield of biomass and of constituent molecules and plant vigor are 
modulated to create benefits by genetically changing: 
3. Growth rate of 

(h) Seedling 

(i) Coleoptile elongation 
(j) Young leaves 

To change any of the phenotype(s) above, activities of one or more of the early 
seedling phase genes or gene products are modulated in an organism and the consequence 
evaluated by screening for the desired trait. Specifically, the gene, mRNA levels, or protein 
levels are altered in a plant utilizing the procedures described herein and the phenotypes can 
be assayed. As an example, a plant can be transformed according to Bechtold and Pelletier 
(Methods. Mol. Biol. 82:259-266 (1998)) with leaf gene constructs and/or screened for 
variants as in Winkler et al, Plant Physiol. 118: 743-50 (1998) and visually inspected for the 
desired phenotype and metabolically and/or functionally assayed for altered levels of relevant 
molecules. 

D.5.C. USE OF EARLY SEEDLING PHASE GENES, GENE COMPONENTS 
AND PRODUCTS TO MODULATE BIOCHEMICAL ACTIVITIES 
Seedlings are complex and their structure, function and properties result from the 
integration of many processes and biochemical activities. Some of these are known from the 
published literature and some can be deduced from the genes and their products described in 
this application. Early seedling phase genes, and gene components are used singly or in 
combination to modify these processes and biochemical activities and hence modify the 
phenotypic and trait characteristics described above. Examples of the processes and 
metabolic activities are given in the Table below. The resulting changes are measured 
according to the citations included in the Table. 



PROCESS 


BIOCHEMICAL OR 


CITATIONS 




METABOLIC 


INCLUDING ASSAYS 




ACTIVITIES 






AND/OR 






PATHWAYS 
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PROCESS 


BIOCHEMICAL OR 


CITATIONS 




METABOLIC 


INCLUDING ASSAYS 




ACTIVITIES 






AND/OR 






PATHWAYS 




Metabolism - anabolic 


G. Farnesyl 


Pei et al., Science 282: 287-290 


and catabolic 


ation 


(1998); Cutler et al., Science 273: 




H. Cell 


1239 (1996) 




Wall 


Goupil et al, J Exptl. Botany 




Biosynt 


49:1855-62(1998) 




hesis 


Walch-Liu et al., J Exppt. Botany 




I. Nitrogen 


51, 227-237 (2000) 




Metabol 






ism 






J. Seconda 












Metabol 






ite 






Biosynt 






hesis 






and 






Degrada 






tion 




Water Conservation And 


A. Production of 


Allen et al., Plant Cell 1 1 : 1785- 


Resistance To Drought 


polyols 


1798 (1999) 


And Other Related 


B. Regulation of 


Li et al., Science 287: 300-303 


Stresses 


salt 


(2000) 




concentration 


Burnett et al, J Exptl. Botany 51: 




C. ABA 


197-205 (2000) 




response(s) 


Raschke, In: Stomatal Function, 
Zeiger et al Eds., 253-279 (1987) 
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PROCESS 




BIOCHEMICAL OR 


CITATIONS 






METABOLIC 


INCLUDING ASSAYS 






ACTIVITIES 








AND/OR 








PATHWAYS 










(i) Ca2+ 


Lacombe et al., Plant Cell 12: 837- 


Transport Anion and 






Accumu 


51 (2000); 


Cation Fluxes 






lation 


Wang et al., Plant Physiol. 






(a) 


K+ Fluxes 


118:1421-1429(1998); 






(b) 


Na+ Fluxes 


Shietal., Plant Cell 11:2393- 






1. 


Receptor - 


2406 (1999) 








ligand binding 


Gaymard et al., Cell 94:647-655 






2. 


Anion and 


(1998) 








Cation fluxes 


Jonak et al., Proc. Natl. Acad. Sci. 










93: 11274-79 (1996); 










Sheen, Proc. Natl. Acad. Sci. 95: 










975-80 (1998); 










Allen et al., Plant Cell 11: 1785-98 










(1999) 


Carbon Fixation 




3. 


Calvin Cycle 


Wingler et al., Philo Trans R Soe 




5. 




Photorespiratio 


Lond B Biol Sci 355, 1517-1529 






n 




(2000); 




6. 




Oxygen 








evolution 


Palecanda et al., Plant Mol Biol 




7. 




RuBisCO 


46, 89-97 (2001); 






4. 


Chlorophyll 


Baker et al., J Exp Bot 52, 615- 








metabolism 


621 (2001) 








(ii) Chloropl 










ast 


Chen et al., Acta Biochim Pol 41, 








Biogene 


447-457 (1999) 








sis and 


Imlau et al., PlantCell H, 309-322 








Metabol 


(1999) 
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PROCESS 


BIOCHEMICAL OR 


CITATIONS 




METABOLIC 


INCLUDING ASSAYS 


• 


ACTIVITIES 








AND/OR 








PATHWAYS 












ism 






5. 


Fatty Acid and 








Lipid 










Biosynthesis 








(iii) 


Glyoxyl 










ate 




i 






metaboli 










sm 








(iv) 


Sugar 










Transpo 










rt 








(v) 


Starch 










Biosynt 










hesis 










and 










Degrada 










tion 




Hormone Perception and 




(vi) 


Hormon 




Growth 






e 










Recepto 


Tieman et al., Plant J 26, 47-58 








rs and 


(2001) 








Downstr 


Hilpert et al., Plant J 26, 435-446 








earn 


(2001) 








Pathway 










s for 


Wenzel et al., Plant Phys 124, 




(a) 


ethylene 


813-822 (2000) 




(b) 


jasmonic acid 


Dengler and Kang, Curr Opin 
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PROCESS 


BIOCHEMICAL OR 

METABOLIC 

ACTIVITIES 

AND/OR 

PATHWAYS 


CITATIONS 
INCLUDING ASSAYS 




(c) brassinosteroid 

(d) gibberellin 

(e) auxin 

(f) cytokinin 

- Activation Of Specific 
Kinases And 
Phosphatases 


Plant Biol 4, 50-56 (2001) 
Tantikanjana et al., Genes Dev 15, 
1577-1580(2001) 


See Imbibition, Shoot Apical Meristem, Root and Leaf sect 


ions for more details 



Other biological activities that are modulated by the leaf genes and gene products are 
listed in the Reference tables. Assays for detecting such biological activities are described in 
the Protein Domain table, for example. 



D.5.d. USE OF EARLY SEEDLING PHASE GENES, GENE 
COMPONENTS AND PRODUCTS TO MODULATE TRANSCRIPTION LEVELS 
The expression of many genes is "up regulated" or down regulated" in plants because 
some genes and their products are integrated into complex networks that regulate 
transcription of many other genes. Some early seedling phase genes, gene components and 
products are therefore useful for modifying the transcription of other genes and hence 
complex phenotypes, as described above. Profiles of leaf gene activities are described in the 
Table below with associated biological activities. "Up-regulated" profiles are those where 
the mRNA transcript levels are higher in young seedlings as compared to the sterilized seeds. 
"Down-regulated" profiles represent higher transcript levels in the plantlet as compared to 
sterilized seed only. 
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I.D.3. SIZE AND STATURE GENES, GENE COMPONENTS AND 
PRODUCTS 

Great agronomic value can result from modulating the size of a plant as a whole or of 
any of its organs. For example, the green revolution came about as a result of creating dwarf 
wheat plants, which produced a higher seed yield than taller plants because they could withstand 
higher levels and inputs of fertilizer and water. Size and stature genes elucidated here are 
capable of modifying the growth of either an organism as a whole or of localized organs or cells. 
Manipulation of such genes, gene components and products can enhance many traits of 
economic interest from increased seed and fruit size to increased lodging resistance. Many 
kinds of genes control the height attained by a plant and the size of the organs. For genes 
additional to the ones in this section other sections of the Application should be consulted. 

a) Identification of Size and Stature Genes, Gene Components and 
Products 

Size and stature genes identified herein are defined as genes, gene components and 
products capable of modulating one or more processes in growth and development, to 
produce changes in size of one or more organs. Examples of such stature genes and gene 
products are shown in the Reference, Sequence, Protein Group, Protein Group Matrix, 
Knock-in, Knock-out, MA-diff and MA-clust. The biochemical functions of the protein 
products of many of these genes determined from comparisons with known proteins are also 
given in the Reference tables. 

b) Size And Stature Genes, Gene Components And Products 
Identified By Phenotypic Observations 

Mutant plants exhibiting increased or decreased stature in comparison to parental 
wild-type plants were used to identify size and stature genes. In these experiments, size and 
stature genes were identified by either (1) the ectopic expression of a cDNA in a plant or (2) 
mutagenesis of the plant genome. The plants were then cultivated and stature genes were 
identified from plants that were smaller than the parental 4 Vild-type", The phenotypes and 
gene mutations associated with them are given in Tables 

Examples of phenotypes, biochemical activities, or transcript profiles that are 
modulated using these genes are described above and below. 
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c) Use Of Size And Stature Genes, Gene Components And 

Products To Modulate Phenotypes 
Typically, these genes can cause or regulate cell division, rate and time; and also cell 
size and shape. Many produce their effects via meristems. These genes can be divided into 
three classes. One class of genes acts during cytokinesis and/or karyokinesis, such as mitosis 
and/or meiosis. A second class is involved in cell growth; examples include genes regulating 
metabolism and nutrient uptake pathways. Another class includes genes that control 
pathways that regulate or constrain cell division and growth. Examples of these pathways 
include those specifying hormone biosynthesis, hormone sensing and pathways activated by 
hormones. 

Size and stature genes and gene components are useful to selectively alter the size of 
organs and stems and so make plants specifically improved for agriculture, horticulture and 
other industries. There are a huge number of utilities. For example, reductions in height of 
specific ornamentals, crops and tree species can be beneficial, while increasing height of 
others may be beneficial. 

Increasing the length of the floral stems of cut flowers in some species would be 
useful, while increasing leaf size in others would be economically attractive. Enhancing the 
size of specific plant parts, such as seeds, to enhance yields by stimulating hormone 
(Brassinolide) synthesis specifically in these cells would be beneficial. Another application 
would be to stimulate early flowering by altering levels of gibberellic acid in specific cells. 
Changes in organ size and biomass also results in changes in the mass of constituent 
molecules. This makes the utilities of size and stature genes useful for the production of 
valuable molecules in parts of plants, for extraction by the chemical and pharmaceutical 
industries. 

Examples of phenotypes that can be modulated by the genes and gene components are 
described above and below: 

I. Cellular Level: 

Size and stature genes and gene products can be used to modulate cellular changes in: 

A. Cell size 

B. Cell shape 

C. Cell division, rate and direction 
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D. Cell elongation 

E. Cell differentiation 

F. Stomata number 

G. Trichome number 



Organ Level 

The genes of the invention are useful to regulate the development and growth of: 

A. Roots 

1. Primary 

2. Lateral 

3. Root hairs 

4. Root cap 

5. Apical meristem 

6. Epidermis 

7. Cortex 

8. Stele 

B. Stem 

1. Pholem 

2. Xylem 

3. Nodes 

4. Internodes 

5. Shoot apical meristem 

C. Leaves 

1. Cauline 

2. Rosette 

3. Petioles 

D. Flowers 

1. Receptacle 

2. Sepals, Petals, and Tepals 

(a) Color 

(b) Shape 

(c) Size 

(d) Number 
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(e) Petal Drop 

3. Androecium 

4. Stamen 

5. Anther 

6. Pollen 

7. Sterility 

8. Size, shape, weight, color 

9. Filament 

10. Gynoecium 

11. Carpel 

12. Ovary 

13. Style 

14. Stigma 

15. Ovule 

(a) Size, shape, number 

1 6. Pedicel and Peduncle 

17. Flowering time 

18. Fertilization 
Seeds 

1. Placenta 

2. Embryo 

3. Cotyledon 

4. Endosperm 

5. Suspensor 

6. Seed coat (testa) 
Fruits 

1. Pericarp - thickness, texture 

2. Exocarp 

3. Mesocarp 

4. Endocarp 
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Overall Organism Level 

The following traits can be modulated with the genes and gene products of this invention 
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to affect the traits of a plant as a whole: 


A. 


Architecture 




1. 


Branching 




2. 


Ornamental architecture 


5 


3. 


Shade avoidance 




4. 


Planting density effects 




5. 


Wind resistance 


B. 


Vigor 






1. 


Increased biomass 


10 


2. 


Drought tolerance 



' To regulate any of the phenotype(s) above, activities of one or more of the sizing 
genes or gene products are modulated in an organism and tested by screening for the desired 
D trait. Specifically, the gene, mRNA levels, or protein levels can be altered in a plant utilizing 
£5 the procedures described herein and the phenotypes can be assayed. As an example, a plant 
E can be transformed according to Bechtold and Pelletier (Methods. MoL Biol. 82:259-266 
fU (1998)) and/or screened for variants as in Winkler et al. 5 (Plant Physiol. 118: 743-50 1998) 
gi and visually inspected for the desired phenotype or metabolically and/or functionally assayed. 

n 

§0 D.3.b. USE OF SIZE AND STATURE GENES, GENE COMPONENTS 

T A ND PRODUCTS TO MODULATE BIOCHEMICAL ACTIVITIES 

"3* — 

rf Many metabolic and developmental processes can be modulated by size and stature 

genes and gene components to achieve the phenotypic characteristics exemplified above. 
Some of these are listed below. Such biological activities can be measured according to the 

25 citations included in the Table below: 



PROCESS 


BIOCHEMICAL OR 
METABOLIC ACTIVITIES 
AND/OR PATHWAYS 


CITATIONS INCLUDING 
ASSAYS 


Growth and Development 


Gibberellic Acid Biosynthesis 
Gibberellic Acid Receptor and 
Downstream Pathways 


Swain SM, Tseng Ts, 
Olszewski NE. Altered 
expression of spindly affects 
gibberellin response and plant 
development. Plant Physiol 
2001 Jul;126(3):l 174-85 
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PROCESS 



BIOCHEMICAL OR 
METABOLIC ACTIVITIES 
AND/OR PATHWAYS 



CITATIONS INCLUDING 
ASSAYS 



Hooley, R. Gibberellins: 
perception, transduction, and 
responses. Plant Mol. Biol. 
1994 26:1529-1555. 



Hooley, R. Gibberellins: 
perception, transduction, and 
responses. Plant Mol. Biol. 
1994 26:1529-1555. 



Brassinolide Biosynthesis 
Brassinolide Receptors, 
Degradation of Brassinolide 
Pathways affected by 
Brassinolide 



Perata, P, Matsukura, C, 
Vernieri, P, Yamaguchi, J, 
Sugar repression of a 
gibberellin-dependent 
signaling pathway in barley 
embryos. Plant Cell 1997 
9:2197-2208. 

Noguchi T, Fujioka S, Choe S, 
Takatsuto S, Tax FE, Yoshida 
S, Feldmann KA. Biosynthetic 
pathways of brassinolide in 
Arabidopsis. Plant Physiol 
2000 Sep;124(l):201-9 

Wang ZY, Seto H, Fujioka S, 
Yoshida S, Chory J. BRI1 is a 
critical component of a 
plasma-membrane receptor for 
plant steroids. Nature 2001 
Mar 15;410(6826):380-3 

Neff MM, Nguyen SM, 
Malancharuvil EJ, Fujioka S, 
Noguchi T, Seto H, Tsubuki 
M, Honda T, Takatsuto S, 
Yoshida S, Chory J. BAS1: A 
gene regulating brassinosteroid 
levels and light responsiveness 
in Arabidopsis. Proc Natl Acad 
SciUS A 1999 Dec 
21;96(26):15316-23 

Kang JG, Yun J, Kim DH, 
Chung KS, Fujioka S, Kim JL 
Dae HW, Yoshida S, 
Takatsuto S, Song PS, Park 
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PROCESS 



BIOCHEMICAL OR 
METABOLIC ACTIVITIES 
AND/OR PATHWAYS 



CITATIONS INCLUDING 
ASSAYS 



Cytokinin biosynthesis 
Cytokinin receptor 
Degradation of Cytokinin 
Pathways affected by Cytokinin 



Auxin Biosynthesis 
Auxin Receptor 
Auxin Degradation 
Pathways affected by Auxins 
Auxin transport 



CM. Light and brassinosteroid 
signals are integrated via a 
dark-induced small G protein 
in etiolated seedling growth. 
Cell 2001 Jun l;105(5):625-36 

Mok DW, Mok MC. Cytokinin 
metabolism and action. Annu 
Rev Plant Physiol Plant Mol 
Biol 2001;52:89-118 

Schmulling T. CREam of 
cytokinin signalling: receptor 
identified. Trends Plant Sci 
2001 Jul;6(7):281-4 

Mok DW, Mok MC. Cytokinin 
metabolism and action. Annu 
Rev Plant Physiol Plant Mol 
Biol 2001;52:89-118 

Seyedi M, Selstam E, Timko 
MP, Sundqvist C. The 
cytokinin 2-isopentenyladenine 
causes partial reversion to 
skotomorphogenesis and 
induces formation of 
prolamellar bodies and 
protochlorophyllide657 in the 
lipl mutant of pea. Physiol 
Plant 2001 Jun;112(2):261-272 

Zhao Y, Christensen SK, 
Fankhauser C, Cashman JR, 
Cohen JD, Weigel D, Chory J. 
A role for flavin 
monooxygenase-like enzymes 
in auxin biosynthesis. Science 
2001 Jan 12;291(5502):306-9 

Abel S, Ballas N, Wong LM, 
Theologis A. DNA elements 
responsive to auxin. Bioessays 
1996Aug;18(8):647-54 

del Pozo JC, Estelle M. 
Function of the ubiquitin- 
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PROCESS 



BIOCHEMICAL OR 
METABOLIC ACTIVITIES 
AND/OR PATHWAYS 



CITATIONS INCLUDING 
ASSAYS 



proteosome pathway in auxin 
response. Trends Plant Sci 
1999 Mar;4(3):107-112. 

Rahman A, Amakawa T, Goto 
N, Tsurumi S. Auxin is a 
positive regulator for ethylene- 
mediated response in the 
growth of Arabidopsis roots. 
Plant Cell Physiol 2001 Mar; 
42(3):301-7 

Zhao Y, Christensen SK, 
Fankhauser C, Cashman JR, 
Cohen JD, Weigel D, Chory J. 
A role for flavin 
monooxygenase-like enzymes 
in auxin biosynthesis. Science 
2001 Jan 12;291(5502):306-9 

Abel S, Ballas N, Wong LM, 
Theologis A. DNA elements 
responsive to auxin. Bioessays 
1996 Aug;18(8):647-54 

del Pozo JC, Estelle M. 
Function of the ubiquitin- 
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PROCESS 


BIOCHEMICAL OR 
METABOLIC ACTIVITIES 
AND/OR PATHWAYS 


CITATIONS INCLUDING 
ASSAYS 




Cell wall growth 


Estelle M., Polar auxin 
transport. New support for an 
old model. Plant Cell 1998 
Nov;10(ll): 1775-8 

Cosgrove DJ., Loosening of 
plant cell walls by expansins. 
Nature 2000 Sep 
21;407(6802):321-6 



Other biological activities that are modulated by the stature genes and gene products 
are listed in the Reference tables. Assays for detecting such biological activities are 
5 described in the Protein Domain table, for example. 
^ Changes in the size, vigor, or yield of a plant are the result of modulation of the 

C 5 activities of one or more of these many size and stature genes and gene products. While size 
S and stature polynucleotides and gene products can act alone, combinations of these 
p»j polynucleotides and also with others that also affect growth and development are especially 
ffO useful. 

E Use of Pro moters of "Size and Stature" Genes 

n ■ ^ — — — ^ ™^ — 

3 (J 

J* Promoters of "size and stature" genes are useful for controlling the transcription of 

any desired polynucleotides, both plant and non-plant. They can be discovered from the "size 

15 and stature" genes in the Reference Tables, and their patterns of activity from the MA Tables. 
When operably linked to any polynucleotide encoding a protein, and inserted into a plant, the 
protein will be synthesized in those cells in which the promoter is active. Many "size and 
stature" genes will function in meristems, so the promoters will be useful for expressing 
proteins in meristems. The promoters can be used to cause loss of, as well as synthesis of, 

20 specific proteins via antisense and sense suppression approaches. 
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I.D.4. SHOOT-APICAL MERISTEM GENES, GENE COMPONENTS AND 



PRODUCTS 



New organs, stems, leaves, branches and inflorescences develop from the stem apical 
meristem (SAM). The growth structure and architecture of the plant therefore depends on the 
behavior of SAMs. Shoot apical meristems (SAMs) are comprised of a number of 
morphologically undifferentiated, dividing cells located at the tips of shoots. SAM genes 
elucidated here are capable of modifying the activity of SAMs and thereby many traits of 
economic interest from ornamental leaf shape to organ number to responses to plant density. 



SAM genes identified herein are defined as genes, gene components and products 
capable of modulating one or more processes or functions of SAMs as described below. 
Regulation of SAM genes and gene products are useful to control many plant traits including 
architecture, yield and vigor. Examples of such SAM genes and gene products are shown in the 
Reference, Sequence, Protein Group, Protein Group Matrix, phenotype and MA-diff Tables. 
The functions of many of the protein products of these genes are also given in the Reference 
tables. 

Sam Genes, Gene Components And Products Identified By Phenotypic Observations 
SAM genes were discovered and characterized from a much larger set of genes by 
experiments designed to find genes that cause phenotypic changes in leaf morphology, such as 
cotyledon or leaf fusion. In these experiments, SAM genes were identified by either (1) ectopic 
expression of a cDNA in a plant or (2) mutagenesis of the plant genome. The plants were then 
cultivated and one or more of the following phenotypes, which varied from the parental 'Svild- 
type", was observed: 



a) 



IDENTIFICATION OF SAM GENES, GENE COMPONENTS 



AND PRODUCTS 



I. 



Cotyledon 



Fused 



n. 



Leaves 



Fused 



Leaf placement on stems 



Page 241 of 772 



m. Branching 

Number 
IV. Flowers 

Petals fused 

5 - Altered bolting 

Early bolting 
Late bolting 
Strong bolting 
Weak bolting 

10 - Abnormal branching 

For more experimental detail see the Example section below. The genes identified by 
these results of the phenotypes that are shown in Knock-in and Knock-out Tables. 

Jj> Sam Genes, Gene Components And Products Identified By Differential Expression 

JT SAM genes were also identified in experiments designed to find genes whose mRNA 

W products are associated specifically or preferentially with SAMs. The concentration of mRNA 

products in the arabidopsis plant with the SHOOTMERISTEMLESS (STM) gene knocked-out 
5 was measured relative to the concentration in the parental, non-mutant plant. The Arabidopsis 

STM gene is required for embryonic SAM formation. The STM gene encodes a Knottedl 

n i 

^ (Knl) type of homeodomain protein. Homeodomain proteins regulate transcription of many 

p genes in many species and have been shown to play a role in the regulation of translation as 

M 

well. Seedlings homozygous for recessive loss-of-function alleles germinate with roots, a 
hypocotyl, and cotyledons, but no SAM is formed. The MA_diff Table(s) reports the transcript 

25 levels of the experiment (see EXPT ID: 108478, 108479, 108480, 108481, 108598, 108535, 

108536, 108435). For transcripts that had higher levels in the samples than the control, a "+" is 
shown. A "-" is shown for when transcript levels were reduced in root tips as compared to the 
control. For more experimental detail see the Example section below. 

Meristem genes are those sequences that showed differential expression as compared to 

30 controls, namely those sequences identified in the MA_diff tables with a or "-" indication 

Meristem Genes Identified By Cluster Analyses Of Differential Expression 
Meristem Genes Identified By Correlation To Genes That Are 
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Differentially Expressed 

As described above, the transcription profiles of genes that act together are well 
correlated. Applicants not only have identified the genes that are differentially expressed in the 
microarray experiments, but also have identified the genes that act in concert with them. The 
MA_clust table indicates groups of genes that have well correlated transcription profiles and 
therefore participate in the same pathway or network. 

A pathway or network of Meristem genes is any group in the MA_clust that comprises a 
cDNA ID that also appears in Expt ID 108478, 108479, 108480, 108481, 108598, 108535, 
108536, 108435 oftheMA_diff table(s). 

Meristem Genes Identified By Correlation To Genes That Cause 
Physiological Consequences 

Additionally, the differential expression data and the phenotypic observations can be 
merged to identify pathways or networks of Meristem genes. A group in the MA_clust is 
considered a Meristem pathway or network if the group comprises a cDNA ID that also appears 
in Knock-in or Knock-out tables that causes one or more of the phenotypes described in section 
above. 

Meristem Genes Identified By Amino Acid Sequence Similarity 
Meristem genes from other plant species typically encode polypeptides that share amino 
acid similarity to the sequences encoded by corn and Arabidopsis Meristem genes. Groups of 
Meristem genes are identified in the Protein Group table. In this table, any protein group that 
comprises a peptide ID that corresponds to a cDNA ID member of a Meristem pathway or 
network is a group of proteins that also exhibits Meristem functions/utilities. 

Examples of phenotypes, biochemical activities, and transcription profiles that 
can be modulated by SAM genes and gene products are described above and below. 

b) USE OF SAM GENES, GENE COMPONENTS AND 
PRODUCTS TO MODULATE PHENOTYPES 
With the SAM genes and gene products of the invention, Applicants provide the 
means to modulate one or more of the following types of SAMs: 

1 . Embryonic meristem 

2. Vegetative lateral SAMs 
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3. Inflorescence lateral SAMs 

4. Floral meristems 

5. Adventitious SAM 



The SAM genes of the instant invention are useful for modulating one or more 
processes of SAM structure and/or function including (I) cell size and division; (II) cell 
differentiation and organ primordia. 

L Cell Size and Division 

A. Cell Properties 

SAM genes and gene products can be used to modulate changes in: 

1. Cell size 

2. Cell division, rate and direction 

3 . Cell division symmetry 

A key attribute of the SAM is its capacity for self-renewal. The self-renewing initial cell 
population resides in the central zone of the SAM. A small number of slowly dividing initial 
cells (typically 2 to 4 per layer) act as a self-replenishing population, whereas some of their 
descendants, pushed out onto the flanks of the SAM, differentiate into leaves. Other 
descendants, displaced below the SAM, differentiate into stem. The immediate descendants of 
the initial cells divide further, amplifying the cell population before being incorporated into leaf 
or stem primordia. 

The genes and gene components of this invention are useful for modulating any one or 
all of these cell division processes generally, as in timing and rate, for example. In addition, the 
polynucleotides and polypeptides of the invention can control the response of these processes to 
the internal plant programs associated with: 

1. Embryogenesis 

2. Hormone responses 

Cytokinin (inhibitory for root development, see section on cytokinin- 
responsive genes) 

3. Coordination of growth and development with that of other plant 
organs 

a. Leaves 

b. Flowers 
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c. Seeds 

d. Fruits 

SAM genes can also be used to control the response of these processes to changes in the 
environment, including heat, cold, drought, high light and nutrition. 

B. Sam Cell Patterns And Organization 
Although SAMs appear as small regions of morphological undifferentiated dividing 
cells, a group of morphologically undifferentiated dividing cells does not necessarily constitute a 
SAM. Rather, evidence indicates that SAMs are highly organized or patterned regions of the 
plant in which many important events in early organogenesis occur. Thus, the term "SAM" is 
used to denote a highly organized structure and site of pattern formation. The invention also 
permits engineering of specific as well as overall features of SAM architecture as follows: 

1. Zones 

a. Central 

b. Peripheral 

c. Rib 

2. Layers 

a. LI 

b. L2 

c. L3 

3. Symmetry 

II Cell Differentiation And Organ Primordia 

The apical meristem in many species first undergoes a vegetative phase whereby cells 
set aside from the apex become leaf primordia with an axillary vegetative meristem. Upon floral 
induction, the apical meristem is converted to an inflorescence meristem. The inflorescence 
meristem arises in the axils of modified leaves and is indeterminate, producing whorls or rings 
of floral organ primordia. In species which produce terminal flowers, the apical meristem is 
determinate and eventually adopts a third identity, that of a floral meristem. Examples of the 
plant properties that the genes and gene products of the invention can be used to modulate 
include: 

1. Indeterminancy 

a. Inhibiting or increasing differentiation 
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b. Enhancing plant growth and yield 



2. Symmetry 

a. Symmetry of organs developed 

b. Symmetry of arrangement of organs, such as leaves, petals, 
flowers, etc. 

3. Leaf fate and timing (for more detail see leaf section) 

a. Internode length modulation 

i. Longer internodes to increase shade avoidance 

ii. Shorter internodes to favor leaf development 

4. Floral fate and timing of flowering (for more detail see the reproduction 
section) 

Uses Of Plants Modified As Described Above Using SAM Genes, Gene Components 
And Products 

Because SAMs determine the architecture of the plant, modified plants will be useful in 
many agricultural, horticultural, forestry and other industrial sectors. Plants with a different 
shape, numbers of flowers and seed and fruits will have altered yields of plant parts. For 
example, plants with more branches can produce more flowers, seed or fruits. Trees without 
lateral branches will produce long lengths of clean timber. Plants with greater yields of specific 
plant parts will be useful sources of constituent chemicals. Such plants will have, for example: 

A. More prolific leaf development (see Leaf section for more detail) 

B. Better optimized stem and shoot development (see Stem section for more detail) 

C. Adventitious shoots 

D. More flowers, seeds, and fruits (see Reproduction section for more detail) 

E. Enhanced vigor, including growth rate of: 



1. 


Whole plant, including height, flowering time, etc. 


2. 


Seedling 


3. 


Coleoptile elongation 


4. 


Young leaves 


5. 


Flowers 


6. 


Seeds 


7. 


Fruit 
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F. Higher yields based on: 



1. 



Biomass 



Fresh and dry weight during any time in plant life, including 



maturation and senescence 



Number of flowers 



Seed yield 

a. Number, size, weight, harvest index 

b. Content and composition, e.g. amino acid, jasmonate, oil, 



protein and starch 



Fruit yield 

a. Number, size, weight, harvest index 

b. Content and composition, e.g. amino acid, jasmonate, oil, 



protein and starch 



To regulate any of the phenotype(s) above, activities of one or more of the SAM 
genes or gene products can be modulated and tested by screening for the desired trait. 
Specifically, the gene, mRNA levels, or protein levels can be altered in a plant utilizing the 
procedures described herein and the phenotypes can be assayed. As an example, a plant can 
be transformed according to Bechtold and Pelletier (1998, Methods. Mol. Biol. 82:259-266) 
and/or screened for variants as in Winkler et al. (1998) Plant Physiol 118: 743-50 and 
visually inspected for the desired phenotype or metabolically and/or functionally assayed 
according to Dolan et al. (1993, Development 119: 71-84), Dolan et al. (1997, Development 
124: 1789-98), Crawford and Glass (1998, Trends Plant Science 3: 389-95), Wang et al. 
(1998, PNAS USA 95: 15134-39), Gaxiola et al. (1998, PNAS USA 95: 4046-50), Apse et al. 
(1999, Science 285: 1256-58), Fisher and Long (1992, Nature 357: 655-60), Schneider et al. 
(1998, Genes Devel 12: 2013-21) and Hirsch (1999, Curr Opin Plant Biol. 2: 320-326). 



SAM genes and gene components are useful for modulating biochemical or metabolic 
activities and/or pathways such as those noted below. Such biological activities can be 
measured according to the citations included in the Table below: 



c) USE OF SAM GENES AND GENE COMPONENTS TO 
MODULATE BIOCHEMICAL ACTIVITIES 
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PROCESS 


BIOCHEMICAL OR 
METABOLIC ACTIVITIES 
AND/OR PATHWAYS 


CITATIONS INCLUDING 
ASSAYS 


Growth , Differentiation 
And Development 

c 


Leaf shape and inflorescence and 
flower morphology systems 
Activities of SAM 
transcriptional regulatory 
proteins. 


Chuck, G. et al, 1996 Plant Cell 
8: 1227-1289. 
Schneeberger et al., 1998 
Development 125: 2857-2865. 




Meristem size and organ number 
determinants 

- Regulated by Receptor Kinases 

- Receptor kinase location and 
activity. 


- Kayes, J.M. and Clark, S.E. 
1998 Development 125: 3843- 
3851. 

- Jeong, S. et al., 1999 Plant 
Cell 11: 1925-1934. 




Meristem proliferation activities 


Tantikanjana, T. Genes and 
Development. June 15, 2001. 
15(12):1577-1588. 


Internode elongation 


Hormone signaling pathways 


Yamamuro, C. et al., 2000 Plant 
Cell. 12: 1591-1605. 


Hormone 
Perception 


Levels of growth hormones 
including gibberellic acid, Auxin 
and cytokinin. 


Kusaba, S. et al; 1998 Plant 
Physiology 1 1 6(2) :47 1-476. 




Gibberellic acid biosynthesis 
GA biosynthetic enzyme GA-20 
oxidase is a required step in GA 
biosynthesis. GA-20 oxidase is 
Regulated by some SAM gene 
products. 


Modulation of GA perception 
and function can be assayed as 
described in Sakamoto, T. et al. 
2001 Genes and Development 
15:581-590. 




Over expression of SAM genes 
can lead to reduced internode 
elongation, reduced cell 
elongation and reduced cell 


Sakamoto, T. et al. 2001. 
Genes and Development 15: 
581-590. 
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PROCESS 


BIOCHEMICAL OR 
METABOLIC ACTIVITIES 
AND/OR PATHWAYS 


CITATIONS INCLUDING 
ASSAYS 




expansion. 






Cytokinin Receptor activity 


Inoue, T. et al., Nature 
409:1060-1063. 




SAM gene products can affect 
the activity of Auxin dependent 
postranscriptional gene protein 
expression. 


Sieberer, T. et al., 2000 Current 
Biology 10:1595-1598. 
del Pozo, J. C.;Estelle, M. 
PNAS (USA) 1999. 
96(26): 15342-1 5347. 




SAM gene products can affect 
Auxin Perception/metabolism in 
the meristem to produce useful 
changes in plant architecture. 


Tantikanjana, T. Genes and 
Development. June 15, 2001. 
15(12):1577-1588. 


Leaf senescence 


SAM gene products can increase 
and decrease leaf senescence 
rate. This can be done by 
modulating cytokinin hormone 
levels. 


Ori, N. et al; Plant Cell. June, 
1999. 11(6):1073-1080. 




Cytokinin effect on cell division 
and expansion. 


Beemster, Gerrit T. S.; Baskin, 
Tobias I. 2000 Plant Physiology 
124:1718-1727. 


Adventitious shoot 
formation 


Alter growth hormone status. 


Kusaba,S. etal; 1998 Plant 
Physiology 116(2):471-476 




Ectopic expression of SAM 
genes in leaf or other non SAM 
organs or tissue can produce 
shoots 


Chuck, G. 1996 Plant Cell 8: 
1227-1289. 




Pathways comprising 
isopentenyl transferase (ipt) 
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Other biological activities that can be modulated by the SAM genes and gene products 
are listed in the Reference tables. Assays for detecting such biological activities are described in 
the Protein Domain table. 



5 d) USE OF SAM GENES, GENE COMPONENTS AND 

PRODUCTS TO MODULATE TRANSCRIPTION LEVELS OF 
OTHER GENES 

The expression of many genes is "upregulated" or "downregulated" in the SAM mutants 
because some of the SAM genes are integrated into complex networks that regulate the 

10 transcription of many other genes. Some SAM genes and gene components are therefore useful 
for modifying the transcription of other genes and hence complex phenotypes as described 
above. Profiles of genes altered by SAM mutations and genes are described in the Table below 
with associated biological activities. "Up-regulated" profiles are for genes whose mRNA levels 

C are higher in the stm plants as compared to parental wild-type plants; and vice-versa for "down- 

TSST 

JS regulated" profiles. 



TRANSCRIPT 


TYPE OF GENES 


PHYSIOLOGICAL 


EXAMPLES OF 


LEVELS 


WHOSE 








TRANSCRIPTS ARE 


MODIFYING SAM 


ACTIVITIES WHOSE 




CHANGED 


GENE PRODUCT 


TRANSCRIPTS ARE 






LEVELS 


CHANGED 


Up Regulated 


Genes repressed by 


• Altered 


• Transporters 


Transcripts 


SAMs directly or 


Auxin/cytokinin 


• Metabolic Enzymes 




indirectly 


hormone ratio and 


• Cell Membrane 






perception. 


Structure 






• Increased/decreased 


• Kinases, Phosphatases, 






cell expansion - 


G-Proteins 






promoting effects of 


• Transcription 






brassinosteroids and 


Activators/Repressors 






gibberellic acids, due 


• Transcription 






to altered levels of 


coactivators/corepressor 






biosynthetic pathway 


s 






enzymes and or the 


• Chromatin Structure 






amount of functional 
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TRANSCRIPT 


TYPE OF GENES 


PHYSIOLOGICAL 


EXAMPLES OF 


LEVELS 


WHOSE 


CONSEQUENCES OF 


BIOCHEMICAL 




TRANSCRIPTS ARE 


MODIFYING SAM 


ACTIVITIES WHOSE 




CHANGED 


GENE PRODUCT 


TRANSCRIPTS ARE 






LEVELS 


CHANGED 






hormone receptor. 


And/Or Localized DNA 






• Increased or 


Topology Proteins 






decreased rate of cell 


• Cell Wall Proteins 






division. 


• Translational 






• Altered planes of cell 


activators/repressors 






division 


• Cell wall proteins 






• Increased or 


involved in cell rigidity 






decreased rate and 


e.g. extensin, glycine 






extent of cell 


rich proteins. 






expansion. 


• Cell cycle regulatory 






• Increased or 


proteins such as cyclins 






decreased rigidity of 


and cyclin dependent 






cell ways. 


protein kinases (CDKs). 


Down-Regulated 


Genes involved in SAM 


• Altered pattern of 


• Auxin transporter 


Transcripts 


cells and genes whose 


organs immerging 


proteins 




expression is induced by 


from the meristem 


• Auxin receptor proteins 




SAMs 


• Increased or 


• Cytokinin receptor 






decreased the number 


proteins 






of cells partitioned 


• Gibberellic acid receptor 






into a lateral organ. 


proteins 






• Altered apical 


• Brassinolide receptor 






dominance due to 


proteins 






suppression of lateral 


• Hormone biosynthesis 






bud growth. 


proteins 






• Altered apical 


• Hormone degradation 






dominance due to 


proteins 






releasing of axillary 


• Hormone conjugation 
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TRANSCRIPT 


TYPE OF GENES 


PHYSIOLOGICAL 


EXAMPLES OF 


LEVELS 


WHOSE 


CONSEQUENCES OF 


BIOCHEMICAL 




TRANSCRIPTS ARE 


MODIFYING SAM 


ACTIVITIES WHOSE 




CHANGED 


GENE PRODUCT 


TRANSCRIPTS ARE 






LEVELS 


CHANGED 






meristems from 


proteins 






repression. 


• Ubiquitin conjugating 






• Increased/or 


enzymes. 






decreased production 


• Receptor kinase signal 






of adventitious 


transduction 






meristems. 








• Increased potential to 








form somatic 








embryos. 








• Altered cell signaling 








pathways 








• Altered hormone 








levels 





SAM genes and gene products can be modulated alone or in combination as described in 
the introduction. Of particular interest are combination of these genes and gene products with 
those that modulate hormone responsive pathways. Hormone responsive genes and gene 
products are described in more detail in the sections below. 



Use Of Sam Gene Promoters To Modify Sams 

Promoters of SAM genes, as described in the Reference tables, for example, can be used 
to modulate transcription of coding sequences in SAM cells to influence growth, differentiation 
or patterning of development or any of the phenotypes or biological activities above. For 
example, any desired sequence can be transcribed in similar temporal, tissue, or environmentally 
specific patterns as a SAM gene when the desired sequence is operably linked to the promoter of 
the SAM gene. 

A specific instance is linking of a SAM gene promoter normally active in floral 
meristem primordia, to a phytotoxic protein coding sequence to inhibit apical meristem 
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switching into an inflorescence and/or floral meristem, thereby preventing flowering. 

SAM gene promoters can also be used to induce transcription of antisense RNA copies 
of a gene or an RNA variant to achieve reduced synthesis of a specific protein in specific SAM 
cells. This provides an alternative way to the example above, to prevent flowering. 



TRANSCRIPT 


TYPE OF GENES 


PHYSIOLOGICAL 


EXAMPLES OF 


LEVELS 


WHOSE 


CONSEQUENCES 


BIOCHEMICAL 




TRANSCRIPTS 


OF MODIFYING 


ACTIVITIES OF 




ARE CHANGED 


GENE PRODUCT 


GENE PRODUCTS 






LEVELS 


WITH MODIFIED 








T 1 7T71T O 

LEVELS 


Up regulated 


Genes involved in 


• Leaf cells 


• Transcription 


Transcripts 


leaf, stem and root 


proliferate and 


factors, signal 




cell differentiation, 


differentiate; 


A J A* 

transduction 




1 1 j • * * ii 

cell division, cell 


• Leaf structures 


proteins, kinase 




expansion 


-T J J 

form and expand 


and phosphatases 








• Chromatin 




Genes involved in 




remodeling 




positive regulation of 




• Hormone 




root, stem and leaf 




biosynthesis 




genes 




enzymes 








• Receptors 




Repressors of root 








and other organ cell 








types e.g. flowers 








Genes involved in 


• Photosynthesis 


• Light harvesting 




photosynthesis 


and plastid 


coupled to ATP 






differentiation 


production 






• Calvin cycle 


• Chlorophyll 






activated 


1 * . i • 
biosynthesis 






• Chloroplast 


• Ribulose 






biogenesis and 


Bisphosphate 






1 A* J 

plastid 


carboxylase 






differentiation 


• Chloroplast 






activated 


membranes 








synthesis 








• i^nioropiasi 








ribosome 








biogenesis 




Other genes involved 


• Starch 


• Starch synthase 




in metabolism 


biosynthesis 


• Nitrate reductase 






• Lipid 


• Terpenoid 






biosynthesis 


biosynthesis 






• Nitrogen 


• Transcription 






metabolism - 


factors 






N03 reduced and 


• Transporters 






amino acids made 


• Kinases 
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TRANSCRIPT 
LEVELS 


TYPE OF GENES 
WHOSE 
TRANSCRIPTS 
ARE CHANGED 


PHYSIOLOGICAL 
CONSEQUENCES 
OF MODIFYING 
GENE PRODUCT 
LEVELS 


EXAMPLES OF 
BIOCHEMICAL 
ACTIVITIES OF 
GENE PRODUCTS 
WITH MODIFIED 
LEVELS 






• Secondary 
metabolites 
produced 


Phosphatases and 
signal 

transduction 
protein 
• Chromatin 
structure 
modulators 


Down regulated 
genes 


Genes involved in 
negative regulation 
of root, stem and leaf 
genes 

Genes involved in 
other organs e.g. 
flowers 


• Leaf genes 
activated and leaf 
functions induced 

• Other organs not 
induced 

• Leaf, stem and 
root metabolic 
pathways induced 


• Transcription 
factors 

• Signal 
transduction 
proteins - kinases 
and phosphatases 

• Metabolic 
enzymes 

• Chromatin 
remodeling 
proteins 



While early seedling phase polynucleotides and gene products are used singly, 
combinations of these polynucleotides are often better to optimize new growth and 
development patterns. Useful combinations include different leaf polynucleotides and/or 
gene products with a hormone responsive polynucleotide. These combinations are useful 
because of the interactions that exist between hormone-regulated pathways, nutritional 
pathways and development. 

Use of Early Seedling Phase Gene Promoters 

Promoters of early seedling phase genes are useful for transcription of desired 
polynucleotides, both plant and non-plant. If the gene is expressed only in the post- 
germination seedling, or in certain kinds of leaf cells, the promoter is used to drive the 
synthesis of proteins specifically in those cells. For example, extra copies of carbohydrate 
transporter cDNAs operably linked to a early seedling phase gene promoter and inserted into 
a plant increase the "sink" strength of leaves. Similarly, early seedling phase promoters are 
used to drive transcription of metabolic enzymes that alter the oil, starch, protein, or fiber 
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contents of the seedling. Alternatively, the promoters direct expression of non-plant genes 
that can, for instance, confer resistance to specific pathogen. Additionally the promoters are 
used to synthesize an antisense mRNA copy of a gene to inactivate the normal gene 
expression into protein. The promoters are used to drive synthesis of sense RNAs to 
inactivate protein production via RNA interference. 

I.D.5. VEGETATIVE-PHASE SPECIFIC RESPONSIVE GENES, GENE 
COMPONENTS AND PRODUCTS 

Often growth and yield are limited by the ability of a plant to tolerate stress 
conditions, including water loss. To combat such conditions, plant cells deploy a battery of 
responses that are controlled by a phase shift, from so called juvenile to adult. These changes 
at distinct times involve, for example, cotyledons and leaves, guard cells in stomata, and 
biochemical activities involved with sugar and nitrogen metabolism. These responses depend 
on the functioning of an internal clock, that becomes entrained to plant development, and a 
series of downstream signaling events leading to transcription-independent and transcription- 
dependent stress responses. These responses involve changes in gene expression. 

Manipulation of the activation of one or more genes controlling the phase changes are 
useful to modulate the biological processes and/or phenotypes listed below. Phase responsive 
genes and gene products can act alone or in combination. Useful combinations include phase 
responsive genes and/or gene products with similar transcription profiles, similar biological 
activities, or members of the same or functionally related biochemical pathways. Whole 
pathways or segments of pathways are controlled by transcription factor proteins and proteins 
controlling the activity of signal transduction pathways. Therefore, manipulation of such 
protein levels is especially useful for altering phenotypes and biochemical activities of plants. 

Phase responsive genes and gene products can function to either increase or dampen 
the above phenotypes or activities. Characterization of phase responsive genes was carried 
out using microarray technology. Microarray technology allows monitoring of gene 
expression levels for thousands of genes in a single experiment. This is achieved by 
hybridizing labeled fluorescent cDNA pools to glass slides that contain spots of DNA 
(Schena et al. (1995) Science 270: 467-70). The US Arabidopsis Functional Genomics 
Consortium (AFGC) has recently made public the results from such microarray experiments 
conducted with AFGC chips containing about 10,000 non-redundant ESTs, selected from 
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about 37,000 randomly sequenced ESTs generated from mRNA of different tissues and 
developmental stages. 

The sequences of the ESTs showing at least two-fold increases or decreases in a mutant of 
Arabidopsis thaliana, squint, that appears not to undergo phase changes and appears adult- 
like throughout its growth cycle, compared with wild type were identified, compared to the 
Ceres full length cDNA and genomic sequence databanks, and equivalent Ceres clones 
identified. The MA_diff tables reports the results of this analysis, indicating those Ceres 
clones which are up or down regulated over controls, thereby indicating the Ceres clones 
which represent phase responsive genes. The MA_diff Table(s) reports the transcript levels of 
the experiment (see EXPT ID: Sqn (relating to SMD 7133, SMD 7137)). For transcripts that 
had higher levels in the samples than the control, a "+" is shown. A "-" is shown for when 
transcript levels were reduced in root tips as compared to the control. For more experimental 
detail see the Example section below. 

Phase responsive genes are those sequences that showed differential expression as 
compared to controls, namely those sequences identified in the MA_diff tables with a "+" or 
"-" indication 

Phase Responsive Genes Identified By Cluster Analyses Of Differential Expression 

Phase Responsive Genes Identified By Correlation To Genes That Are Differentially 
Expressed 

As described above, the transcription profiles of genes that act together are well 
correlated. Applicants not only have identified the genes that are differentially expressed in 
the microarray experiments, but also have identified the genes that act in concert with them. 
The MA_clust table indicates groups of genes that have well correlated transcription profiles 
and therefore participate in the same pathway or network. 

A pathway or network of phase responsive genes is any group in the MA_clust that 
comprises a cDNA ED that also appears in Expt ID Sqn (relating to SMD 7133, SMD 7137) 
of the MA_diff table(s). 

Phase Responsive Genes Identified By Correlation To Genes That Cause 
Physiological Consequences 

Additionally, the differential expression data and the phenotypic observations can be 
merged to identify pathways or networks of phase responsive genes. A group in the 
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MA_clust is considered a phase responsive pathway or network if the group comprises a 
cDNA ED that also appears in Knock-in or Knock-out tables that causes one or more of the 
phenotypes described in section above. 

Phase Responsive Genes Identified By Amino Acid Sequence Similarity 

Phase responsive genes from other plant species typically encode polypeptides 
that share amino acid similarity to the sequences encoded by corn and Arabidopsis phase 
responsive genes. Groups of phase responsive genes are identified in the Protein Grouping 
table. In this table, any protein group that comprises a peptide ID that corresponds to a 
cDNA ID member of a phase responsive pathway or network is a group of proteins that also 
exhibits Phase responsive functions/utilities. 

Further, promoters of phase responsive genes, as described in Reference tables, for 
example, are useful to modulate transcription that is induced by phase or any of the following 
phenotypes or biological activities below. Further, any desired sequence can be transcribed in 
similar temporal, tissue, or environmentally specific patterns as the phase responsive genes 
when the desired sequence is operably linked to a promoter of a phase responsive gene. 

a) USE OF PHASE RESPONSIVE GENES TO MODULATE 
PHENOTYPES 

Phase responsive genes and gene products are useful to or modulate one or more of 
the following phenotypes: 

Timing Phenotypes 

- Dormancy 

- Germination 

- Cotyledon opening 

- First leaves 

- Juvenile to adult transition 

- Bolting 

- Flowering 

- Pollination 

- Fertilization 

- Seed development 
Seed set 
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- Fruit Drop 
Senescence 

- Epinasty 

- Biomass 

- Fresh and Dry Weight during any time in plant life, such as maturation 

- Number of Flowers, Seeds, Branches, and/or Leaves 

- Seed Yield, including Number, Size, Weight, and/or Harvest Index 

- Fruit Yield, including Number, Size, Weight, and/or Harvest Index 

- Plant Development 

- Time to Fruit Maturity 

- Cell Wall Strengthening and Reinforcement 

- Stress Tolerance 

- Drought tolerance 

. - Flooding tolerance 

- UV tolerance 

To regulate any of the phenotype(s) above, activities of one or more of the phase 
responsive genes or gene products can be modulated and the plants can be tested by screening 
for the desired trait. Specifically, the gene, mRNA levels, or protein levels can be altered in a 
plant utilizing the procedures described herein and the phenotypes can be screened for 
variants as in Anderson et al. (1997) Plant Cell 9: 1727-1743; Heintzen et al. (1997) Proc. 
Natl. Acad. Sci. USA 94: 8515-20; Schaffer et al. (1998) Cell 93:1219-1229; Somers et al. 
(1998) Development 125: 485-494; Somers et al. (1998) Science 282: 1488-1490; Wang and 
Tobin (1998) Cell 93: 1207-1217; Zhong et al. (1998) Plant Cell 10: 2005-2017; Sugano et 
al. (1998) Proc. Natl Acad. Sci. USA 95: 11020-11025; Dowson-Day and Millar (1999) 
Plant J\l\ 63-71; Green and Tobin (1999) Proc. Natl Acad. Sci. USA 96: 4176-419; Staiger 
and Apel (1999) Mol Gen. Genet. 261: 811-819; Strayer and Kay (1999) Curr. Opin. Plant 
Biol 2:1 14-120; Strayer et. al. (2000) Science 289:768-771; Kreps et al. (2000) J Biol 
Rhythms (2000) 15:208-217; Nelson et al. (2000) Cell 101:331-340; Somers et al. (2000) Cell 
101:319-329. 



USE OF PHASE RESPONSIVE GENES TO MODULATE 



BIOCHEMICAL ACTIVITIES 
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he activities of one or more of the phase responsive genes can be modulated to change 
biochemical or metabolic activities and/or pathways such as those noted below. Such biological 
activities are documented and can be measured according to the citations above and included in 
the table below: 



Process 


Biochemical Or Metabolic 
Activities And/Or Pathways 


Citations including assays 


Germination And 

Seedling 

Development 


Cold, Light And Water 
Modulated Signal Transduction 
Pathways, Receptors, Kinases, 
PAS Domain Proteins 


Bognar et al. (1999; Proc. Natl Acad. 
Set USA 96:14652-14657; Suganoet 
al (1999) Proc, Natl. Acad. Set USA 
96:12362-12366; Dowson-Day and 
Millar (1999) Plant J 11: 63-71; 
Somers et al. (2000) Cell 101:319- 
329; Zhong et al. (1998) Plant Cell 
10: 2005-2017 


Growth 

Transitions And 
Flowering 


Cold And Light Modulated 
Signal Transduction Pathways, 
Receptors, Kinases, PAS 
Domain Protiens 


Nelson et al. (2000) Cell 101:331-340; 
Fowler et al. (1999) EMBOJ. 
18:4679-4688 


Tuber Formation 


Cold And Light Modulated 
Signal Transduction Pathways 


Yanovsky et al. (2000) Plant 1 23: 
223-232 


METABOLISM 






Lipid Metabolism 


Membrane Lipid Synthesis 
Including Omega-3 Fatty Acid 
Desaturase, Lipases, Lipid 
Transfer Proteins 


Bradley and Reddy (1997) 1 
BacterioL 179: 4407-4410; Martin, M 
et al. 1999 Europe J. Biochem 262: 
283-290 
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Process 


Biochemical Or Metabolic 
Activities And/Or Pathways 


Citations including assays 


Sugar 

Metabolism 


Glycosylhydrolases, 
Glycosyltransferases, 
Amylases, Sucrose Synthase, 
CAB, Rubisco, Light Signal 
Transduction 


Liu et al. (1996) Plant Physiol. 
1 12:43-51; Millar and Kay (1996) 
Proc Natl Acad Sci USA 93:15491- 
15496; Wang et al. (1997) Plant Cell 
9:491-507; Shinohara et al (1999) J. 
Biol. Chem. 273: 446-452 


Nitrogen 
Metabolism 


Aminotransferases, Arginase, 
Proteases And Vegetative 
Storage Proteins, Aromatic 
Amino Acid Synthesis 


Bradley and Reddy (1997) J. 
Bacteriol. 179: 4407-4410 


Photorespiration 


Mitochondrial, Chloroplast And 
Peroxisomal Photorespiratory 
Enzymes, Serine 
Hydroxymethyl Transferases, 
Catalase 


Zhong and McClung (1996) Mol. Gen. 
Genet. 251:196-203; McClung (1997) 
Free. Radio. Biol. Med. 23:489-496; 
McClung et al. (2000) Plant Physiol. 
123:381-392 


Responses To 

Environmental 

Stress 


Expression Of Genes Involved 
In Responses To Drought, Salt, 
UV 


McClung (1997) Free Radio Biol Med 
23:489-496; Shi et al. (2000) Proc. 
Natl. Acad. Sci. USA 97:6896-6901 



Other biological activities that can be modulated by the phase responsive genes and 
their products are listed in the Reference tables. Assays for detecting such biological 
activities are described in the Protein Domain table. 

Phase responsive genes are characteristically differentially transcribed in response to 
maturity of the cell, organ or tissue which depends on a timing mechanism, which is internal 
to an organism or cell. The Intensity Table reports the changes in transcript levels of various 
phase responsive genes in a plant. 

The data from this experiment reveal a number of types of phase responsive genes and 
gene products. Profiles of some classes of phase responsive genes are shown in the table 
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below with examples of which associated biological activities are modulated when the 
activities of one or more such genes vary in plants. 



Transcript 


Type Of Genes 


Physiological 


Examples Of 


Levels 




Consequences 


Biochemical Activity 


Up Regulated 


Responders To 


• Adult phase 




Transcripts 


mutation that confers 


adoption 


• Metabolic Enzymes 




adult like phase 


• Metabolisms 


• Change In Cell 






Affected By phase 


Membrane Structure 






change 


And Potential 




Genes induced in 


• Synthesis Of 


• Kinases And 




adult-like phase 


Secondary 


Phosphatases 






Metabolites 


• Transcription 






And/Or Proteins 


Activators 






• Modulation Of 


• Change In Chromatin 






Phase Response 


Structure And/Or 






Transduction 


Localized DNA 






Pathways 


Topology 






Specific Gene 








Transcription 








Initiation 




Down- 


Responders To 


• Negative 


• Transcription Factors 


Regulated 


mutation that confers 


Regulation of 


• Change In Protein 


Transcripts 


adult phase 


adult phase 


Structure By 






pathways 


Phosphorylation 




Genes repressed in 




(Kinases) Or 




adult-like phase 


• Changes In 


Dephosphoryaltion 






Pathways And 


(Phosphatases) 




Genes With 


Processes 


• Change In Chromatin 




Discontinued 


Operating In Cells 


Structure And/Or 




Expression Or 




DNA Topology 




Unstable mRNA in 


Changes In 


• Stability Factors For 




adult-like phase 


Metabolic 


Protein Synthesis And 






pathways other 


Degradation 






than phase 


• Metabolic Enzymes 






specific pathways 





Use of Promoters of Phase Responsive Genes 

Promoters of phase responsive genes are useful for transcription of any desired 
polynucleotide or plant or non-plant origin. Further, any desired sequence can be transcribed 
in a similar temporal, tissue, or environmentally specific patterns as the phase responsive 
genes where the desired sequence is operably linked to a promoter of a phase responsive 
gene. The protein product of such a polynucleotide is usually synthesized in the same cells, 
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in response to the same stimuli as the protein product of the gene from which the promoter 
was derived. Such promoter are also useful to produce antisense mRNAs to down-regulate 
the product of proteins, or to produce sense mRNAs to down-regulate mRNAs via sense 
suppression. 
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H. HORMONE RESPONSIVE GENES, GENE COMPONENTS AND PRODUCTS 

H.A. ABSCISSIC ACID RESPONSIVE GENES, GENE COMPONENTS AND 
PRODUCTS 

Plant hormones are naturally occurring substances, effective in very small amounts, 
which act as signals to stimulate or inhibit growth or regulate developmental processes in 
plants. Abscisic acid (ABA) is a ubiquitous hormone in vascular plants that has been 
detected in every major organ or living tissue from the root to the apical bud. The major 
physiological responses affected by ABA are dormancy, stress stomatal closure, water 
uptake, abscission and senescence. In contrast to auxins, cytokinins and gibberellins, which 
are principally growth promoters, ABA primarily acts as an inhibitor of growth and metabolic 
processes. 

Changes in ABA concentration internally or in the surrounding environment in 
contact with a plant results in modulation of many genes and gene products. Examples of 
such ABA responsive genes and gene products are shown in the Reference, Sequence, 
Protein Group, Protein Group Matrix tables, MA_diff, and MA_clust tables. These genes 
and/or products are responsible for effects on traits such as plant vigor and seed yield. They 
were discovered and characterized from a much larger set of genes by experiments designed 
to find genes whose mRNA products changed in concentration in response to application of 
ABA to plants. 

While ABA responsive polynucleotides and gene products can act alone, 
combinations of these polynucleotides also affect growth and development. Useful 
combinations include different ABA responsive polynucleotides and/or gene products that 
have similar transcription profiles or similar biological activities, and members of the same or 
similar biochemical pathways. Whole pathways or segments of pathways are controlled by 
transcription factor proteins and proteins controlling the activity of signal transduction 
pathways. Therefore, manipulation of such protein levels is especially useful for altering 
phenotypes and biochemical activities of plants. In addition, the combination of an ABA 
responsive polynucleotide and/or gene product with another environmentally responsive 
polynucleotide is also useful because of the interactions that exist between hormone regulated 
pathways, stress and defence induced pathways, nutritional pathways and development. 
Here, in addition to polynucleotides having similar transcription profiles and/or biological 
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activities, useful combinations include polynucleotides that may have different transcription 
profiles but which participate in common or overlapping pathways. 

Such ABA responsive genes and gene products can function to either increase or 
dampen the above phenotypes or activities either in response to changes in ABA 
5 concentration or in the absence of ABA fluctuations. The MA_diff Table(s) reports the 
transcript levels of the experiment (see EXPT ID: 108560, 108561, 108513, 108597). For 
transcripts that had higher levels in the samples than the control, a "+" is shown. A "-" is 
shown for when transcript levels were reduced in root tips as compared to the control. For 
more experimental detail see the Example section below. 
10 ABA genes are those sequences that showed differential expression as compared to 

controls, namely those sequences identified in the MA_diff tables with a "+" or "-" indication 

ABA Genes Identified By Cluster Analyses Of Differential Expression 
O ABA Genes Identified By Correlation To Genes That Are Differentially 

Jb> Expressed 

E As described above, the transcription profiles of genes that act together are 

ru well correlated. Applicants not only have identified the genes that are differentially 

fji expressed in the microarray experiments, but also have identified the genes that act in concert 

JL with them. The MA_clust table indicates groups of genes that have well correlated 

W0 transcription profiles and therefore participate in the same pathway or network. 

rJ 

A pathway or network of ABA genes is any group in the MA_clust that 
p; comprises a cDNA ID that also appears in Expt ID 108560, 108561, 108513, 108597 of the 
MA_difftable(s). 

25 ABA Genes Identified By Correlation To Genes That Cause Physiological 

Consequences 

Additionally, the differential expression data and the phenotypic observations 
can be merged to identify pathways or networks of ABA genes. A group in the MA_clust is 
considered a ABA pathway or network if the group comprises a cDNA ID that also appears 
30 in Knock-in or Knock-out tables that causes one or more of the phenotypes described in 
section above. 
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ABA Genes Identified By Amino Acid Sequence Similarity 

ABA genes from other plant species typically encode polypeptides that share 
amino acid similarity to the sequences encoded by corn and Arabidopsis ABA genes. Groups 
of ABA genes are identified in the Protein Group table. In this table, any protein group that 
comprises a peptide ID that corresponds to a cDNA ID member of a ABA pathway or 
network is a group of proteins that also exhibits ABA functions/utilities. 

Further, promoters of ABA responsive genes, as described in the Reference tables, for 
example, are useful to modulate transcription that is induced by ABA or any of the following 
phenotypes or biological activities below. 

HA.1. USE OF ABSCISSIC ACID RESPONSIVE GENES TO MODULATE 
PHENOTYPES 

ABA responsive genes and gene products are useful to or modulate one or more of the 
following phenotypes: 

- Development 

- Cell Growth 

- Promotion Of Leaf Cell Elongation 

- Fruit Development 

Fruit Drop 

- Inhibition Of Parthenocarpy And Ovary Growth 

- Seed Development 

- Maturation Of Zygotic And Somatic Embryos 

- Embryo Development 

Seed Development And Maturation 

- Acquisition Of Desiccation Tolerance 

- Dormancy 

- Control Rate And Timing Of Germination 

- Prolongation Of Seed Storage And Viability 

- Inhibition Of Hydro lytic Enzyme Synthesis 

- Growth 

- Roots 
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- Inhibition Of Root Elongation Under Low Water Potential 

- Stems 

- Buds 

- Promotion Of Dormancy 

- Lateral/Axillary Bud Formation 

- Leaves 

- Inhibition Of ABA-Induced Growth And Elongation 

- Biomass 

- Fresh And Dry Weight During Any Time In Plant Life, Such As 
Maturation; 

- Number, Size, And Weight Of 

Flowers; 

- Seeds; 

- Senescence 

- Abscission 

- LeafFall 

- Flower Longevity 

- Differentiation 

- Plastid/Chloroplast Differentiation 

- Regulation Of Sterility 

- Stress Responses 

- Mediation Of Response To Desiccation, Drought, Salt And Cold 



To regulate any of the phenotype(s) above, activities of one or more of the ABA 
responsive genes or gene products can be modulated in an organism and tested by screening 
for the desired trait. Specifically, the gene, mRNA levels, or protein levels can be altered in a 
plant utilizing the procedures described herein and the phenotypes can be assayed. As an 
example, a plant can be transformed according to Bechtold and Pelletier (1998, Methods. 
Mol. Biol. 82:259-266) and/or screened for variants as in Winkler et al. (1998) Plant Physiol 
118: 743-50 and visually inspected for the desired phenotype or metabolically and/or 
functionally assayed according to Koorneef and Karssen (1994, Seed dormancy and 
germination, In: Arabidopsis, Cold Spring harbor Lab. Press, pp 314-334), Cramer et al 
(1998, J. Exptl. Botany 49:191-198), and White and Rivin (2000, Plant Physiol 122: 1089- 
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97). Phillips et al. (1997) EMBO J 16: 4489-96; Nambara et al (1995) Development 121: 
629-636; Hays et al (1999) Plant Physiol. 1 19: 1065-72; Filonova et al (2000) J Exptl Botany 
51: 249-64; White et al (2000) Plant Physiol. 122: 1081-88; and Visser et al. (1998) Plant 
Mol Biol 37: 131-40; Rohde et al. (2000) Plant Cell 12:35-52; and Cramer et al. (1998) J. 
experimental Botany. 49: 191-198. 

n.A.2. USE OF ABSCISSIC ACID RESPONSIVE GENES TO MODULATE 
BIOCHEMICAL ACTIVITIES 



The activities of one or more of the ABA responsive genes can be modulated to 
change biochemical or metabolic activities and/or pathways such as those noted below. Such 
biological activities can be measured according to the citations included in the Table below: 





BIOCHEMICAL OR 




PROCESS 


METABOLIC ACTIVITIES 


CITATIONS INCLUDING ASSAYS 




AND/OR PATHWAYS 




Growth , 


• Farnesylation 


Pei Et Al (1998) Science 282: 287- 


Differentiation And 




290; Cutler Et Al. (1996) Science 


Development 




273: 1239 




• Nitrogen Metabolism 


Goupil Et Al (1998) J Exptl Botany 






49:1855-62 


Water Conservation 


• Stomatal Development 


Allen Et Al. (1999) Plant Cell 1 1 : 


And Resistance To 


And Physiology 


1785-1798 


Drought And Other 




Li Et Al. 2000 Science 287: 300-303 


Related Stresses 




Burnett Et Al 2000. J. Exptl Botany 






51: 197-205 






Raschke (1987) In: Stomatal 






Function Zeiger Et Al. Eds., 253-279 




• Stress Response Pathways 


Bush And Pages (1998) Plant Mol. 






Biol. 37: 425-35 




• Inhibition Of Ethylene 


Spollen Et Al (2000) Plant Physiol. 




Production Under Low 


122:967-976 




Water Potential 






• Proline And Other 


Hare Et Al. (1998) Plant, Cell And 




Osmolite Synthesis And 


Environment 21:535-553; Hare Et Al. 
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BIOCHEMICAL OR 




PROCESS 


METABOLIC ACTIVITIES 


CITATIONS INCLUDING ASSAYS 




AND/OR PATHWAYS 






Degradation 


(1999) J. Exptl. Botany 50:413-434 




• Plasmalemma And 


Macrobbie (1998) Philos Trans R Soc 




Tonoplast Ion Channel 


Lond B Biol Sci 353: 1475-88; Li Et 




Changes 


Al (2000) Science 287:300-303; 






Barkla Et Al. (1999) Plant Physiol. 






120:811-819 




• Ca2+ Accumulation 


Lacombe Et Al. (2000) Plant Cell 12: 






837-51; Wang EtAl. (1998) Plant 






Physiol 1 18:1421-1429; Shi Et Al. 






(1999) Plant Cell 11: 2393-2406 




. K+ Efflux 


Gaymard Et Al. (1998) Cell 94:647- 






655 




• Activation Of Kinases 


Jonak Et Al. (1996) Proc. Natl. Acad. 




And Phosphatases 


Sci 93: 11274-79; Sheen (1998) Proc. 






Natl. Acad. Sci. 95: 975-80; Allen Et 






Al. (1999) Plant Cell 11: 1785-98 



Other biological activities that can be modulated by the ABA responsive genes and 
gene products are listed in the Reference tables. Assays for detecting such biological 
activities are described in the Protein Domain table. 

ABA responsive genes are characteristically differentially transcribed in response to 
fluctuating ABA levels or concentrations, whether internal or external to an organism or cell. 
The MA_diff reports the changes in transcript levels of various ABA responsive genes in 
entire seedlings at 1 and 6 hours after a plant was sprayed with a Hoagland's solution 
enriched with ABA as compared to seedlings sprayed with Hoagland's solution only. 

The data from this time course can be used to identify a number of types of ABA 
responsive genes and gene products, including "early responders," and "delayed ABA 
responders", " early responder repressors" and " delayed repressors". Profiles of these 
different ABA responsive genes are shown in the Table below together with examples of the 
kinds of associated biological activities. 
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TRANSCRIPT 


TYPE OF 


PHYSIOLOGICAL 


EXAMPLES OF 


LEVELS 


GENES 


CONSEQUENCES 


BIOCHEMICAL 








ACTIVITY 


Up Regulated 


Early Responders 


ABA Perception 


Transcription Factors 


Transcripts 


To ABA 


ABA Uptake 


Transporters 


(Level At 1 Hr=6 






Change In Cell Membrane 


Hr)or 






Structure 


(LevelAtlHr>6 




Modulation Of ABA 


Kinases And Phosphatases 


Hr) 




Response 








Transduction 








Pathways 








Specific Gene 


Transcription Activators 






Transcription 


Change In Chromatin 






Initiation 


Structure And/Or 








Localized DNA Topology 


Up Regulated 


Delayed 


Maintenance Of 


Transcription Factors 


Transcripts 


Responders 


Response To ABA 


Specific Factors (Initiation 


(Level At 1 Hr < 6 




Maintenance Of Seed 


And Elongation) For 


Hr) 




Dormancy, Stress 


Protein Synthesis 






Stomatal Closure, 


Maintenance Of Mrna 






Water Uptake 


Stability 






Control, Abscission 


Maintenance Of Protein 






And Senescence 


Stability 






Control Pathways 


Maintenance Of Protein- 








Protein Interaction 


Down-Regulated 


Early Responder 


Negative Regulation 


Transcription Factors 


Transcripts 


Repressors Of 


Of ABA Pathways 


Change In Protein 


(Level AtlHr = 6 


ABA State Of 


Released 


Structure By 


Hr)or 


Metabolism 




Phosphorylation (Kinases) 


(LevelAt6Hr>l 




Changes In Pathways 


Or Dephosphoryaltion 


Hr) 


Genes With 


And Processes 


(Phosphatases) 




Discontinued 


Operating In Cells 


Change In Chromatin 
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TRANSCRIPT 


TYPE OF 


PHYSIOLOGICAL 


EXAMPLES OF 


LEVELS 


GENES 


CONSEQUENCES 


BIOCHEMICAL 








ACTIVITY 




Expression Or 




Structure And/Or DNA 




UnsTable mRNA 




Topology 




In Presence Of 








ABA 






Down-Regulated 


Delayed 


Negative Regulation 


Transcription Factors 


Transcripts 


Repressors Of 


Of ABA Pathways 


Kinases And Phosphatases 


(LevelAtlHr>6 


ABA State Of 


Released 


Stability Of Factors For 


Hr) 


Metabolism 




Protein Synthesis And 






Maintenance Of 


Degradation 




Genes With 


Pathways Released 






Discontinued 


From Repression 






Expression Or 








UnsTable mRNA 


Changes In Pathways 






In Presence Of 


And Processes 






ABA 


Operating In Cells 





Use of Promoters of ABA responsive Genes 

Promoters of ABA responsive genes are useful for transcription of any desired 
polynucleotide or plant or non-plant origin. Further, any desired sequence can be transcribed in 
a similar temporal, tissue, or environmentally specific patterns as the ABA responsive genes 
where the desired sequence is operably linked to a promoter of a ABA responsive gene. The 
protein product of such a polynucleotide is usually synthesized in the same cells, in response to 
the same stimuli as the protein product of the gene from which the promoter was derived. Such 
promoter are also useful to produce antisense mRNAs to down-regulate the product of proteins, 
or to produce sense mRNAs to down-regulate mRNAs via sense suppression. 
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H.B. AUXIN RESPONSIVE GENES, GENE COMPONENTS AND PRODUCTS 

Plant hormones are naturally occurring substances, effective in very small amounts 
that stimulate or inhibit growth or regulate developmental processes in plants. One of the 
plant hormones is indole-3 -acetic acid (IAA), often referred to as Auxin. 

Changes in Auxin concentration in the surrounding environment in contact with a 
plant or in a plant results in modulation of the activities of many genes and hence levels of 
gene products. Examples of such Auxin responsive genes and their products are shown in the 
Reference and Sequence Tables. These genes and/or products are responsible for effects on 
traits such as plant vigor and seed yield. The genes were discovered and characterized from a 
much larger set by experiments designed to find genes whose mRNA products changed in 
response to application of Auxin to plants. 

Manipulation of one or more Auxin responsive gene activities are useful to modulate 
the biological activities and/or phenotypes listed below. Auxin response genes and gene 
products can act alone or in combination. Useful combinations include Auxin response genes 
and/or gene products with similar transcription profiles, similar biological activities, or 
members of the same or functionally related biochemical pathways. Whole pathways or 
segments of pathways are controlled by transcription factor proteins and proteins controlling 
the activity of signal transduction pathways. Therefore, manipulation of the levels of such 
proteins is especially useful for altering phenotypes and biochemical activities of plants. The 
MA__diff Table(s) reports the transcript levels of the experiment (see EXPT ID: 108564, 
108565, 108516, 108554, 108466, 107886, 107891, SMD 3743, andNAA (relating to SMD 
3749, SMD 6338, SMD 6339)). For transcripts that had higher levels in the samples than the 
control, a is shown. A "-" is shown for when transcript levels were reduced in root tips 
as compared to the control. For more experimental detail see the Example section below. 

NAA genes are those sequences that showed differential expression as compared to 
controls, namely those sequences identified in the MA_diff tables with a "+" or "-" indication 

NAA Genes Identified By Cluster Analyses Of Differential Expression 



As described above, the transcription profiles of genes that act together are well 
correlated. Applicants not only have identified the genes that are differentially expressed in 
the microarray experiments, but also have identified the genes that act in concert with them. 



NAA Genes Identified By Correlation To Genes That Are Differentially 



Expressed 
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The MA clust table indicates groups of genes that have well correlated transcription profiles 
and therefore participate in the same pathway or network. 

A pathway or network of NAA genes is any group in the MA_clust that comprises a 
cDNA ID that also appears in Expt ID 108564, 108565, 108516, 108554, 108466, 107886, 
107891, SMD 3743, and NAA (relating to SMD 3749, SMD 6338, SMD 6339) of the 
MA_difftable(s). 



Consequences 

Additionally, the differential expression data and the phenotypic observations can be 
merged to identify pathways or networks of NAA genes. A group in the MA_clust is 
considered a NAA pathway or network if the group comprises a cDNA ID that also appears 
in Knock-in or Knock-out tables that causes one or more of the phenotypes described in 
section above. 

NAA Genes Identified By Amino Acid Sequence Similarity 

NAA genes from other plant species typically encode polypeptides that share amino 
acid similarity to the sequences encoded by corn and Arabidopsis NAA genes. Groups of 
NAA genes are identified in the Protein Group table. In this table, any protein group that 
comprises a peptide ID that corresponds to a cDNA ID member of a NAA pathway or 
network is a group of proteins that also exhibits NAA functions/utilities. 

Such Auxin responsive genes and gene products can function to either increase or 
dampen the above phenotypes or activities either in response to changes in Auxin 
concentration or in the absence of Auxin fluctuations. Further, promoters of Auxin responsive 
genes, as described in the Reference tables, for example, are useful to modulate transcription 
that is induced by auxin or any of the following phenotypes or biological activities below. 



NAA Genes Identified By Correlation To Genes That Cause Physiological 



n.B.l. USE OF AUXIN RESPONSIVE GENES, GENE COMPONENTS 



AND PRODUCTS TO MODULATE PHENOTYPES 



Auxin Responsive Genes And Gene Products Are Useful To Or Modulate One 
Or More Of The Following Phenotypes: 
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• Growth 

• Apical Dominance 

• Vascular Growth 

• Roots 

• Inhibition Of Primary Root Elongation 

• Increased Lateral Root Formation 

• Stems 

• Lateral Buds 

• Lateral Branching 

• Reduction Of Branching 

• For High Density Growth Per Acre 

• For Increased Wood Production 

• Lateral Organ Initiation And/Or Positioning In Apical Meristem, 

• Organ Formation, For Example, Fruit Number In Tomatoes 

• Leaves 

• Height/Stature, E.G., Taller Crops Or Increase Wood Production 

• Regeneration And Differentiation Of Cultured Cells Or Plantlets 

• Biomass 

• Fresh And Dry Weight During Any Time In Plant Life, Such As 
Maturation; 

• Number Of Flowers; 

• Number Of Seeds; 

• Number Of Branches; 

• Number Of Leaves; 

• Starch Content 

• Seed Yield, Including Number, Size, Weight, Harvest Index, Starch 
Content 

• Fruit Yield, Number, Size, Weight, Harvest Index, Starch Content 

• Development 

• Orienting Cell Growth, 

• Establishment And Maintenance Of Plant Axis 

• Apical Dominance 

. Cell Plate Placement 
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• Polarised Growth, Initiation And/Or Development, Of Embryos 
Morphogenic Progression, E.G., From Early Radial To Late Axialized 
Torpedo Stages 

• Differentiation Of Cells Into Morphologically Different Cell Layers 

• Cotyledon Separation 

• Fruit Development 

• Abscission, Leading To Modulation Of Fruit Drop 

• Parthenocarpy, Seedless Crops Resulting From Lack Of Seed Set 

• Vascularization, E.G. Hypocotyl And Cotyledon Tissues 

• Genetic Control Of Vascular Patterning And Influences Its Maturation; 

• Specification Of The Sites Where Vascular Differentiation Will Occur; 

• Determination Of The Direction And Extent Of Vascular Tissue 
Formation 

. Maintenance Of The Continuity Of Vascular Development With Plant 
Growth 

• Tropic Responses 

• Gravitropic Responses, E.G. Affecting Roots And Shoots 

• Modulation Of Phototropic Sensitivity, E.G. Increase Growth Under A 
Reduced Light Spectrum 

Further, any desired sequence can be transcribed in similar temporal, tissue, or 
environmentally specific patterns as the Auxin responsive genes when the desired 
sequence is operably linked to a promoter of an Auxin responsive gene. 

To modulate any of the phenotype(s) above, activities of one or more of the 
Auxin response genes or gene products can be modulated and the plants can be tested 
by screening for the desired trait. Specifically, the gene, mRNA levels, or protein 
levels can be altered in a plant utilizing the procedures described herein and the 
phenotypes can be screened for variants as in Winkler et al. (1998) Plant Physiol 118: 
743-50 and assayed, for example, in accordance with Bechtold and Pelletier (1998). 
Methods Mol. Biol. 82: 259-266; Clough and Bent (1998). 16: 735-743; Krysan et 
al. (1999). Plant Cell 11:2283-2290. 
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HB.2. USE OF AUXIN RESPONSIVE GENES, GENE COMPONENTS 
AND PRODUCTS TO BIOCHEMICAL ACTIVITIES: 

The activities of one or more of the Auxin responsive genes can be modulated 
to change biochemical or metabolic activities and/or pathways such as those noted 
below. Such biological activities are documented and can be measured according to 
the citations included in the Table below: 



PROCESS 


BIOCHEMICAL OR 
METABOLIC ACTIVITIES 
AND/OR PATHWAYS 


CITATIONS INCLUDING 
ASSAYS 


Cell Growth and 
Differentiation 


Protein Ubiquitination 


Gray et al. (1999) Genes and 
Develop, 13:1678-1691 
Bechtold and Pelletier (1998). 
Methods. Mol. Biol. 82:259-266 




Cell Wall loosening and 
Expansion 


Catala et al. (2000). Plant Physiol. 
122:527-534. 

Cosgrove, D. (1993). New Phytol. 
124:1-23. 


Auxin/Cytokinin Ratio 


Changing Auxin and/or 
cytokinin synthesis and/or 
turnover 


Chen et al. (1988). Plant Physiol. 
86:822-825 

Tarn et al. (2000). Plant Physiol. 
123:589-595 

Bartel and Fink. (1995). Science 
268:1745-1748. 

Prinsen et al. (1995). Quantifying 
phytohormones in transformed 
plants. In: Methods in Molecular 
Biology. 44:245-262. 


Auxin Transport 


Channeling of polar Auxin 
Transport 


Reed et al. (1998). Plant Physiol. 

118:1369-1378. 

Estelle, M. (1998). Plant Cell 

10:1775-1778 
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BIOCHEMICAL OR 






PROCESS 


METABOLIC ACTIVITIES 


CITATIONS INCLUDING 






AND/OR PATHWAYS 


ASSAYS 






Auxin Efflux Between Cells 


Reed et al. (1998). Plant Physiol. 








118:1369-1378. 








Marchant et al. (1999). EMBO J. 








18:2066-2073. 






Auxin Influx In and Out of a 


Reed et al. (1998). Plant Physiol. 






Cell 


118:1369-1378. 








Marchant et al. (1999). EMBO J. 








18:2066-2073. 






Electogenic Proton Symport 


Young etal. (1999). Biochim 






of Auxin 


BiophysActa. 1415(2):306-22 




Signal Transduction 


K+ Accumulation 


Philippar et al. (1999). Proc.Natl. 


1 






Acad.Sci. 96:12186-12191 


u 




Permeability of Cell 


Marchant et al. (1999). EMBO J. 


-. tr 

Li : 




Membranes 


18:2066-2073. 


01 




Guanine-Nucleotide 


Steinmann et al. (1999). Science 






Exchange 


286:316-318. 


fy 






Peyroche et al. (1996). Nature 


?? 






384:479-481. 


M 




Protein Phosphorylation 


Christensen et al. (2000). Cell 








100:469-478. 








Hirt (2000). Proc. Natl. Acad Sci. 








97:2405-2407. 






Interaction with Ethylene 


Madlungetal. (1999). Plant 






mode of action 


Physiol. 120:897-906. 








Xu et al. (1998). Plant Physiol. 








118:867-874. 




Protein Turnover 


Localization of Polypeptides 


Grebe et al. (2000). Plant Cell. 






with the basal End of Cells 


12:343-356 
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Other biological activities that can be modulated to by the Auxin responsive 
genes and their products are listed in the Reference Tables. Assays for detecting such 
biological activities are described in the Domain section of the Reference Tables. 
Auxin responsive genes are characteristically differentially transcribed in response to 
fluctuating Auxin levels or concentrations, whether internal or external to an organism 
or cell. The MA_diff(s) report(s) the changes in transcript levels of various Auxin 
responsive genes in the aerial parts of a seedling at 1 and 6 hours after the seedling 
was sprayed with an solution enriched with Auxin as compared to aerial parts of a 
seedling sprayed with water. 

The data from this time course can be used to identify a number of types of 
Auxin responsive genes and gene products, including "early responders," and 
"delayed responders." Profiles of these different classes of Auxin responsive genes 
are shown in the Table below together with examples of the kinds of associated 
biological activities. 
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TRANSCRIPT 


TYPE OF 


PHYSIOLOGICAL 


EXAMPLES OF 


LEVEL 


GENES 


CONSEOUENCES 


BIOCHEMICAL 








ArTTVTTY OF GFNF 










T T 1 A J 

Upregulated 


Early 


: : 

• Auxin perception 


• Transcription factors 


transcripts 


responders to 






(level at 1 


Auxin 


• Auxin 


• Transporters; channeling 


hr^6hours) 




Uptake/transport 


of polar Auxin transport 


(level at 1 hr>6 








hours) 




• Modulation of 


• Kinases and 




Auxin response 


phosphatases; protein 






transduction 


ubiqutination; guanine 






pathways 


nucelotide exchange; 








changing Auxin and/or 








cytokininin synthesis 








and/or turnover; 








interaction with ethylene 








mode of action 






• Initiating 


• Auxin metabolic 






transcription of 


pathways 






specific gene(s) 


• Change in chromatin 








structure and/or UNA 








topology 








• Transcriptional activators 








• Change in activity of 








protein-protein 








interactions 






• Modmcation ot cell 


• Cell wall and cell growth 






walls 


promoting pathways 






• Modification of cell 


• Change in activity of 






structures 


cytoskeletal proteins 








modulating cell structure 






• IVlOUlllCaLl Oil Ul 


• IVlCldUUIlL/ CilZtjrllXCo 






metabolism 


• Coordination and control 








of central carbon and 








Auxin metabolism 
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TRANSCRIPT 


TYPE OF 


PHYSIOLOGICAL 


EXAMPLES OF 


LEVEL 


GENES 


CONSEQUENCES 


BIOCHEMICAL 








ACTIVITY OF GENE 








PRODUCTS 


Upregulated 


"Delayed" 


Completion and/or 


Transcription factors 


transcripts (level 


Responders 


Maintenance of 


Changes in membrane 


at 1 hr <6 hr) 




Auxin response 


protein, membrane channel 








and/or transporter protein 








activity 






• Initiating 








transcription of 


- Change in chromatin 






specific gene(s) 


structure and/or DNA 








topology 








- Transcriptional activators 








- Change in activity of 








protein-protein interactions 






• Modification of cell 








walls 


- Cell wall proteins 






• Modification of cell 


- Change(s) in activity of 






structures 


cytoskeletal proteins 








modulating cell structure 






• Modification of 


- Coordination and control of 






metabolism 


central carbon and Auxin 








metabolism 








-metabolic enzymes 



f ~i 

o 
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TRANSCRIPT 


TYPE OF 


PHYSIOLOGICAL 


EXAMPLES OF 


LEVEL 


GENES 


CONSEQUENCES 


BIOCHEMICAL 








ACTIVITY OF GENE 








PRODUCTS 


Downregulated 


Early repressor 


Repression of 


• Transcription factors 


transcripts 


responders to 


Auxin induced 


• Changes in activity of 


(level at lhour = 


Auxin 


proteins released 


cytoskeletal proteins 


6 hours) 






modulating cell structure 


(level at lhour > 






• Changes in chromatin 


6 hours^ 






structure and/or DNA 


Genes for 




topology 




pathways 




• Changes in protein 




diminished in 


• Reorientation of 


structure and/or function 




presence of 


metabolism in 


by phosphorylation 




Auxin 


certain cells 


(kinases) and/or 








dephosphorylation 








(phosphatases) 








• Stability of factors for 








protein translation 








• Changes in cell 








membrane structure 








• Changes in chromatin 








and/or localized DNA 








topology 








• Changes in protein- 








protein interaction 








• Metabolic enzymes 
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TRANSCRIPT 


TYPE OF 


PHYSIOLOGICAL 


EXAMPLES OF 


LEVEL 


GENES 


CONSEQUENCES 


BIOCHEMICAL 








ACTIVITY OF GENE 








PRODUCTS 


Down-regulated 


"Delayed" 


• Maintenance of 


• Transcription factors 


transcripts 


repressor 


Auxin stimulated 


• Change in activity of 


(level at 1 hour < 


responders to 


state(s) in certain 


cytoskeletal proteins 


6 hours) 


Auxin 


cells 


modulating cell structure 








• Changes in chromatin 








structure and/or DNA 








topology 




Genes for 


• Reorientation of 


• Changes in protein 




pathways 


metabolism in 


structure and/or function 




diminished in 


certain cells 


by phosphorylation 




presence of 




(kinases) and/or 




Auxin 




dephosphorylation 








(phosphatases) 








• Stability of factors for 








protein translation 








• Changes in cell 








membrane structure 








• Changes in chromatin 








and/or localized DNA 








topology 








• Changes in protein- 








protein interaction 








• Metabolic enzymes 



r 8 * 

Use of Promoters of NAA responsive Genes 
4* Promoters of NAA responsive genes are useful for transcription of any desired 

\& polynucleotide or plant or non-plant origin. Further, any desired sequence can be transcribed in 
5 a similar temporal, tissue, or environmentally specific patterns as the NAA responsive genes 
where the desired sequence is operably linked to a promoter of a NAA responsive gene. The 
protein product of such a polynucleotide is usually synthesized in the same cells, in response to 
the same stimuli as the protein product of the gene from which the promoter was derived. Such 
promoter are also useful to produce antisense mRNAs to down-regulate the product of proteins, 
10 or to produce sense mRNAs to down-regulate mRNAs via sense suppression. 
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n.C. BRASSINOSTEROID RESPONSIVE GENES, GENE COMPONENTS AND 
PRODUCTS: 

Plant hormones are naturally occuring substances, effective in very small amounts, 
which act as signals to stimulate or inhibit growth or regulate developmental processes in 
plants. Brassinosteroids (BRs) are the most recently discovered, and least studied, class of 
plant hormones. The major physiological response affected by BRs is the longitudinal growth 
of young tissue via cell elongation and possibly cell division. Consequently, disruptions in 
BR metabolism, perception and activity frequently result in a dwarf phenotype. In addition, 
because BRs are derived from the sterol metabolic pathway, any perturbations to the sterol 
pathway can affect the BR pathway. In the same way, perturbations in the BR pathway can 
have effects on the later part of the sterol pathway and thus the sterol composition of 
membranes. 

Changes in BR concentration in the surrounding environment or in contact with a 
plant result in modulation of many genes and gene products. Examples of such BR 
responsive genes and gene products are shown in the Reference and Sequence Tables. These 
genes and/or products are responsible for effects on traits such as plant biomass and seed 
yield. These genes were discovered and characterized from a much larger set of genes by 
experiments designed to find genes whose mRNA abundance changed in response to 
application of BRs to plants. 

While BR responsive polynucleotides and gene products can act alone, combinations 
of these polynucleotides also affect growth and development. Useful combinations include 
different BR responsive polynucleotides and/or gene products that have similar transcription 
profiles or similar biological activities, and members of the same or functionally related 
biochemical pathways. Whole pathways or segments of pathways are controlled by 
transcription factors and proteins controlling the activity of signal transduction pathways. 
Therefore, manipulation of such protein levels is especially useful for altering phenotypes 
and biochemical activities of plants. In addition, the combination of a BR responsive 
polynucleotide and/or gene product with another environmentally responsive polynucleotide 
is useful because of the interactions that exist between hormone regulated pathways, stress 
pathways, nutritional pathways and development. Here, in addition to polynucleotides 
having similar transcription profiles and/or biological activities, useful combinations include 
polynucleotides that may have different transcription profiles but which participate in 
common or overlapping pathways. The MA_diff Table(s) reports the transcript levels of the 
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experiment (see EXPT ID: 108580, 108581, 108557, 108478, 108479, 108480, 108481). For 
transcripts that had higher levels in the samples than the control, a "+" is shown. A "-" is 
shown for when transcript levels were reduced in root tips as compared to the control. For 
more experimental detail see the Example section below. 
5 BR genes are those sequences that showed differential expression as compared to 

controls, namely those sequences identified in the MA_diff tables with a "+" or indication 

BR Genes Identified By Cluster Analyses Of Differential Expression 

BR Genes Identified By Correlation To Genes That Are Differentially 

10 Expressed 

As described above, the transcription profiles of genes that act together are 
well correlated. Applicants not only have identified the genes that are differentially 
expressed in the microarray experiments, but also have identified the genes that act in concert 
O with them. The MA_clust table indicates groups of genes that have well correlated 
Jj> transcription profiles and therefore participate in the same pathway or network. 
% A pathway or network of BR genes is any group in the MA_clust that 

fU comprises a cDNA ID that also appears in Expt ID 108580, 108581, 108557, 108478, 
J! 108479, 108480, 108481 of the MA_diff table(s). 

OSO BR Genes Identified By Correlation To Genes That Cause Physiological 

n \ 

z jr Consequences 

P Additionally, the differential expression data and the phenotypic observations 

Li 

can be merged to identify pathways or networks of BR genes. A group in the MA_clust is 
considered a BR pathway or network if the group comprises a cDNA ID that also appears in 
25 Knock-in or Knock-out tables that causes one or more of the phenotypes described in section 
above. 

BR Genes Identified By Amino Acid Sequence Similarity 

BR genes from other plant species typically encode polypeptides that share 
30 amino acid similarity to the sequences encoded by corn and Arabidopsis BR genes. Groups 
of BR genes are identified in the Protein Group table. In this table, any protein group that 
comprises a peptide ID that corresponds to a cDNA ID member of a BR pathway or network 
is a group of proteins that also exhibits BR functions/utilities. 
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Such BR responsive genes and gene products can function to either increase or 
dampen the above phenotypes or activities either in response to changes in BR concentration 
or in the absence of BR fluctuations. Further, promoters of BR responsive genes, as 
described in the Reference tables, for example, are useful to modulate transcription that is 
induced by BR or any of the following phenotypes or biological activities below. 

nC.l. USE OF BRASSINOSTEROED RESPONSIVE GENES TO 

MODULATE PHENOTYPES 
Brassinosteroid responsive genes and gene products are useful to or modulate 
one or more of the following phenotypes: 

• Growth 

• Promotes Cell Elongation 

• Elongation Accelerated At Low Temperatures For Increased Plant Growth In 
Marginal Lands 

. Acts In Concert With Other Hormones To Promote Cell Division 

• Roots 

• Inhibitory To Root Growth 

• Expression In Roots Would Inhibit Bud Breaking Due To Higher 
AuximCytokinin Ratio In Epicotyl 

• Stems 

• Inhibits Radial Growth While Causing Stem Elongation 

• In Low Concentrations, Promotes Radial Expansion 

• Increases Biomass 
. Height 

• Seeds 

• Promotes Cell Expansion In Embryo And Thus Enhances Germination 

• Leaves 

• Increase Biomass 

• Flowers 

• Increase Reproduction 

• Biomass 
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• Fresh And Dry Weight During Any Time In Plant Life, Such As Maturation; 

• Number Of Flowers; 

• Number Of Seeds; 

• Number Of Branches; 

• Number Of Leaves; 

• Starch Content 

• Seed Yield, Including Number, Size, Weight, Harvest Index, Starch Content 

• Fruit Yield, Number, Size, Weight, Harvest Index, Starch Content 

• Development 

• Morphogenesis 

• Control Of Organ Size And Shape 

• Development Of New Ornamentals 

• Control Of Leaf Size And Shape 

• Promotes Leaf Unrolling And Enlargement 

• For Development Of New Leafy Ornamentals 

• Seed Development 

• Inhibition Of De-Etiolation 

• Dormancy 

• Accelerated Germination At Low Temperatures 
. Root 

• Gravitropism 

• Senescence 

• Promoted In Light Grown Plants 

• Inhibiting Synthesis Or Perception Could Extend Life Span Of Desired 
Tissues/Organs 

• Differentiation 

• Vascularization 

• Promotes Xylem Differentiation 

• Increases Xylem Fiber Length 

• Resistance Responses 

• Increases Resistance To Pathogens 

• Tropic Responses 
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Gravitropic Responses Affecting Roots 
5 Further, any desired sequence can be transcribed in similar temporal, tissue, or 

enviromentally specific patterns as the BR responsive genes when the desired 
sequence is operably linked to a promoter of a BR responsive gene. 

To improve any of the desired phenotype(s) above, activities of one or more of 
the BR response genes or gene products can be modulated and the plants tested by 
10 screening for the desired trait. Specifically, the gene, mRNA levels, or protein levels 

can be altered in a plant utilizing the procedures described herein and the phenotypes 
can be assayed. As an example, a plant can be transformed according to Bechtold and 
Pelletier (1998, Methods. Mol. Biol. 82:259-266, and/or screened for variants as in 
O Winkler et al. (1998) Plant Physiol 118: 743-50, visually inspected for the desired 

=J;5 phenotype and metabolically and/or functionally assayed according to Choe et al. 

% (1999, Plant Cell 1 1:207-21 and Plant Physiol 119: 897-907), Yamamoto et al. (1997, 

fU Plant Cell Physiol 38:980-3), Asami and Yshida (1999, Trends in Plant Sciences, 

J 4:348-353) and Azpiroz et al. (1998, Plant Cell 10:219-230) 

i-0 n.C.2. USE OF BRASSINOSTEROID RESPONSIVE GENES TO 

% MODULATE BIOCHEMICAL ACTIVITIES 

H The activities of one or more of the BR responsive genes can be modulated to change 

biochemical or metabolic activities and/or pathways such as those noted below. Such 
biological activities are documented and can be measured according to the citations included 
25 in the Table below: 



PROCESS 


BIOCHEMICAL OR 
METABOLIC ACTIVITIES 
AND/OR PATHWAYS 


CITATIONS INCLUDING 
ASSAYS 


BR Transport 


. BR Efflux Between Cells 


B.Schulz and K. Feldmann, 
unpub. results 




. BR Influx In And Out Of A 
Cell 


B.Schulz and K. Feldmann, 
unpub. results 


Signal Transduction 


. Permeability Of Cell 
Membranes 
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• Protein Phosphorylation 




Metabolism 


• Major Growth Coordinating 
Pathways 





Other biological activities that can be modulated by the BR responsive genes and 
gene products are listed in the Reference Tables. Assays for detecting such biological 
activities are described in the Domain section of the Reference Tables. 

BR responsive genes are differentially transcribed in response to fluctuating BR levels 
or concentrations, whether internal or external to an organism or cell. The MA_diff table(s) 
report(s) the changes in transcript levels of various BR responsive genes in the aerial parts of 
a seedling at 1 and 6 hours after a plant was sprayed with a solution enriched with BR as 
compared to seedlings sprayed with water. The data from this time course can be used to 
identify a number of types of BR responsive genes and gene products, including "early 
responders," "delayed responders." Profiles of these different categories of BR responsive 
genes are shown in the Table below together with examples of the kinds of associated 
biological activities. 
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TRANSCRIPT 


TYPE OF GENES 


PHYSIOLOGICAL 


EXAMPLES OF 


LEVELS 




CONSEQUENCES 


BIOCHEMICAL 








ACTIVITY 


Up Regulated 


• Early 


• BR Perception 


• Transcription 


Transcripts 


Responders To 




Factors 


(Level At 1 Hr « 6 


BR 


• BR Transport 


• Receptors 


Hr) 






• Transporters 


(Level At 1 Hr>6 






• Change In Cell 


Hr) 






Membrane Structure 








• Feedback Regulated 






• BR Biosynthesis 


Biosynthetic Genes 






Feedback 










• Kinases And 








Phosphatases 






• Modulation Of 


. 2 nd Messengers, Eg., 






BR Response 


Calmodulin 






Transduction 








Pathways 


• Transcription 








Activators 






Specific Gene 


• Change In 






Transcription 


Chromatin Structure 






Initiation 


And/Or Localized 








DNA Topology 


Up Regulated 


• Delayed 


• Maintenance Of 


• Transcription 


Transcripts 


Responders 


Response To Br 


Factors 


(LevelAtlHr<6 






• BR Biosynthetic 


Hr) 






Genes 








• Specific Factors 








(Initiation And 



Page 288 of 772 



Attorney Ukt. 2730- 14831* 
Client Dkt. 80090.005 



TRANSCRIPT 


TYPE OF GENES 


PHYSIOLOGICAL 


EXAMPLES OF 


LEVELS 




CONSEQUENCES 


BIOCHEMICAL 
ACTIVITY 








Elongation) For 








Protein Synthesis 








• Maintenance Of 








Mrna Stability 








• Maintenance Of 








Protein Stability 








• Maintenance Of 








Protein-Protein 








Interaction 






• Cell And Organ 


. Cell Wall 






Elongation 


Elongation 






Gravitropism 




Down-Regulated 


Early Responder 


• Negative 


• Transcription 


Transcripts 


Repressors Of BR 


Regulation Of 


Factors 


(Level At 1 Hr«6 


State Of Metabolism 


BR Pathways 


• Change In Protein 


Hr) 




Released 


Structure By 


(LevelAt6Hr>l 


Genes With 




Phosphorylation 


Hr) 


Discontinued 


Changes In 


(Kinases) Or 




Expression Or 


Pathways And 


Dephosphoryaltion 




UnsTable Mrna In 


Processes 


(Phosphatases) 




Presence Of 


Operating In 


• Change In 




BR 


Cells 


Chromatin Structure 
And/Or DNA 
Topology 


Down-Regulated 


• Delayed 


Negative 


• Transcription 


Transcripts 


Repressors Of 


Regulation Of 


Factors 


(Level AtlHr>6 


BR State Of 


BR Pathways 


• Kinases And 


Hr) 


Metabolism 


Released 


Phosphatases 
. Stability Of Factors 
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TRANSCRIPT 


TYPE OF GENES 


PHYSIOLOGICAL 


EXAMPLES OF 


LEVELS 




CONSEQUENCES 


BIOCHEMICAL 
ACTIVITY 




. Genes With 


• Maintenance Of 


For Protein 




Discontinued 


Pathways 


Synthesis And 




Expression Or 


Released From 


Degradation 




UnsTable Mrna 


Repression 






In Presence Of 








BR 


• Changes In 
Pathways And 
Processes 
Operating In 
Cells 





Use of Promoters of BR responsive Genes 
fU Promoters of BR responsive genes are useful for transcription of any desired 

c-i : 

m polynucleotide or plant or non-plant origin. Further, any desired sequence can be transcribed 
in a similar temporal, tissue, or environmentally specific patterns as the BR responsive genes 

ffi where the desired sequence is operably linked to a promoter of a BR responsive gene. The 

£ protein product of such a polynucleotide is usually synthesized in the same cells, in response 
to the same stimuli as the protein product of the gene from which the promoter was derived. 
Such promoter are also useful to produce antisense mRNAs to down-regulate the product of 

10 proteins, or to produce sense mRNAs to down-regulate mRNAs via sense suppression. 
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HP. CYTOKININ RESPONSIVE GENES, GENE COMPONENTS AND 
PRODUCTS 

Plant hormones are naturally occurring substances, effective in very small amounts, 
which act as signals to stimulate or inhibit growth or regulate developmental processes in 
plants. Cytokinins (BA) are a group of hormones that are best known for their stimulatory 
effect on cell division, although they also participate in many other processes and pathways. 
All naturally occurring B As are aminopurine derivatives, while nearly all synthetic 
compounds with BA activity are 6-substituted aminopurine derivatives. One of the most 
common synthetic BAs used in agriculture is benzylaminopurine (BAP). 

Changes in B A concentration in the surrounding environment or in contact with a 
plant results in modulation of many genes and gene products. Examples of such BA 
responsive genes and gene products are shown in the Reference, Sequence, Protein Group, 
Protein Group Matrix tables, MA_diff and MA_clust. These genes and/or products are 
responsible for effects on traits such as plant vigor and seed yield. They were discovered and 
characterized from a much larger set by experiments designed to find genes whose mRNA 
products changed in response to application of BA to plants. 

While cytokinin responsive polynucleotides and gene products can act alone, 
combinations of these polynucleotides also affect growth and development. Useful 
combinations include different BA responsive polynucleotides and/or gene products that have 
similar transcription profiles or similar biological activities, and members of the same or 
functionally related biochemical pathways. Whole pathways or segments of pathways are 
controlled by transcription factor proteins and proteins controlling the activity of signal 
transduction pathways. Therefore, manipulation of such protein levels is especially useful for 
altering phenotypes and biochemical activities of plants. In addition, the combination of a 
BA responsive polynucleotide and/or gene product with another environmentally responsive 
polynucleotide is also useful because of the interactions that exist between hormone regulated 
pathways, stress pathways, nutritional pathways and development. Here, in addition to 
polynucleotides having similar transcription profiles and/or biological activities, useful 
combinations include polynucleotides that may have different transcription profiles but which 
participate in common or overlapping pathways. The MA_diff Table(s) reports the transcript 
levels of the experiment (see EXPT ID: 108566, 108567, 108517). For transcripts that had 
higher levels in the samples than the control, a is shown. A "-" is shown for when 
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transcript levels were reduced in root tips as compared to the control. For more experimental 
detail see the Example section below. 

BA genes are those sequences that showed differential expression as compared to 
controls, namely those sequences identified in the MA_diff tables with a "+" or "-" indication 

BA Genes Identified By Cluster Analyses Of Differential Expression 

BA Genes Identified By Correlation To Genes That Are Differentially 

Expressed 

As described above, the transcription profiles of genes that act together are 
well correlated. Applicants not only have identified the genes that are differentially 
expressed in the microarray experiments, but also have identified the genes that act in concert 
with them. The MA_clust table indicates groups of genes that have well correlated 
transcription profiles and therefore participate in the same pathway or network. 

A pathway or network of BA genes is any group in the MA_clust that 
comprises a cDNA ID that also appears in Expt ID 108566, 108567, 108517 of the MA_diff 
table(s). 

BA Genes Identified By Correlation To Genes That Cause Physiological 

Consequences 

Additionally, the differential expression data and the phenotypic observations 
can be merged to identify pathways or networks of BA genes. A group in the MA_clust is 
considered a BA pathway or network if the group comprises a cDNA ED that also appears in 
Knock-in or Knock-out tables that causes one or more of the phenotypes described in section 
above. 

BA Genes Identified By Amino Acid Sequence Similarity 1 

BA genes from other plant species typically encode polypeptides that share 
amino acid similarity to the sequences encoded by corn and Arabidopsis BA genes. Groups 
of BA genes are identified in the Protein Group table. In this table, any protein group that 
comprises a peptide ID that corresponds to a cDNA ID member of a B A pathway or network 
is a group of proteins that also exhibits BA functions/utilities. 
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Such BA responsive genes and gene products can function to either increase or 
dampen the above phenotypes or activities either in response to changes in BA concentration 
or in the absence of BA fluctuations. 

Further, promoters of BA responsive genes, as described in the Reference tables, for 
example, are useful to modulate transcription that is induced by BA or any of the following 
phenotypes or biological activities below. 



BA responsive genes and gene products are useful to or modulate one or more of the 
following phenotypes: 

• Growth 

• Roots 

• Inhibition Of Elongation Of Root 

• Stems 

• Inhibition Of Elongation Of Hypocotyl 

• Lateral Buds 

• Promotion Of Outgrowth 

• For Rapid Production Of Multiple Shoots As A Source For Grafting 

• Leaves 

• Development 

. Cell Growth 



• Expansion Of Cotyledon 

• Promotes Cell Enlargement 

• For Increased Yield From Leaf Crops 

• Chloroplast Development 

• Delayed Degradation Of Chloroplasts 

• For Increased Photosynthesis And Crop Yield 

• Cell Division 

• For Increased Micropropagation 



n.D.l. USE OF BA-RESPONSIVE GENES TO MODULATE 



PHENOTYPES 
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• Senescence 
• Delays 

• For Delayed Conversion From Photosynthesis To Salvage 
Programs In Leaves 
5 • For Increased Crop Yield 

• Differentiation 

• Regulation Of Morphogenesis 

For Manipulating Callus Growth And Shoot/Root Formation In Culture 

• Maintenance Of Shoot Meristem 

1 0 • Increased Usable Wood Production 

• Reduced Tiller Number 

• For Denser Crop Planting Regimes 

• Nutrient Metabolism 

O • Effects On Seed Size 

js • Effects On Rate Of Seed Set 

J; • For Increased Seed Yield 

RJ • Induction Of Ethylene Biosynthesis 

m • Control Of Fruit Ripening 

]L • Parthenocarpy 

[g0 • Control Of Sexual Reproduction 

~* : 

Si: 

• Production Of Seedless Fruits 

n 

To regulate any of the phenotype(s) above, activities of one or more of the BA 
responsive genes or gene products can be modulated and the plants tested by 

25 screening for the desired trait. Specifically, the gene, mRNA levels, or protein levels 

can be altered in a plant utilizing the procedures described herein and the phenotypes 
can be assayed. As an example, a plant can be transformed according to Bechtold and 
Pelletier (1998, Methods. Mol. Biol. 82:259-266) and/or screened for variants as in 
Winkler et al. (1998) Plant Physiol 118: 743-50 and visually inspected for the desired 

30 phenotype or molecularly or metabolically or functionally assayed according to 

Lohman et al (1994, Physil. Plant 92:322-328), Woolhouse (1983, In Agricultural 
Research- Strategies of Plant reproduction, Meudt, ed., 201-236), Medford et al. 
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(1989, Plant Cell 1: 403-13), Vogel et al. (1998, Genetics 149:417-27), Ehnes and 
Roitsch (1997, Plant J 1: 539-48), Rotino et al. (1997, Nat. Biotchnol. 15: 1398-1401). 



n.D.2. USE OF BA-RESPONSIVE GENES TO MODULATE 
BIOCHEMICAL ACTIVITIES 

The activities of one or more of the BA responsive genes can be modulated to 
change biochemical or metabolic activities and/or pathways such as those noted 
below. Such biological activities can be measured according to the citations included 
in the Table below: 



PROCESS 


BIOCHEMICAL OR 
METABOLIC 
ACTIVITIES AND/OR 
PATHWAYS 


CITATIONS 
INCLUDING ASSAYS 


Chloroplast Functioning 


• Photosynthesis 


Benkova et al (1999) Plant 
Physil 121:245-252 


Induction And Maintenance 
Of Cell Division 


• Cell Cycle Phase Transition 


Riou-KhamUchi et al. 
(1999) Science 283: 1541- 
44 


Senescence 


. Cell Death/ Apoptosis 


Lohman et al. (1994) 
Physiol Plant 92: 322-328 


Signal Transduction 


# Sensing Endogenous Stimuli 
To Trigger Growth And 
Shoot Formation 


Kakimoto (1996) Science 
274: 982-985 



Other biological activities that can be modulated by the BA responsive genes and 
gene products are listed in the Reference tables. Assays for detecting such biological 
activities are described in the Domain section above. 

BA responsive genes are characteristically differentially transcribed in response to 
fluctuating BA levels or concentrations, whether internal or external to an organism or cell. 
The MA_diff table reports the changes in transcript levels of various BA responsive genes in 
the aerial parts of a seedling at 1 and 6 hours after a plant was sprayed with a Hoagland's 
solution enriched with BA as compared to seedlings sprayed with Hoagland's solution only. 
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The data from this time course can be used to identify a number of types of BA 
responsive genes and gene products, including "early responders," and "delayed responders." 
Profiles of these different BA responsive genes are shown in the Table below together with 
examples of the kinds of associated biological activities. 
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GENE 


FUNCTIONAL 


TYPE OF 


EXAMPLES OF 


EXPRESSION 


CATEGORY OF 


BIOLOGICAL 


BIOCHEMICAL 


LEVELS 


GENES 


ACTIVITY 


ACTIVITY 


Up Regulated 


• Early 


- BA Perception 


-Transcription Factors 


Transcripts 


Responders To 


-B A Uptake 


-Transporters 


(Level At lh=6h) 


BA 


-Modulation Of BA 


-Kinase, Receptor-Like 


Or 




Response 


Protein Kinase 


(Higher At lh 




Transduction 




Than 6h) 




Pathways 


-Ovule-Specific Homeotic 






-Specific Gene 


Protein, Secretory 






Transcription 


Pathway 






Initiation 


-Cell Division Control 








Protein, Cyclins, Cyclin- 






-Initiate And 


Dependent Protein 






Coordinate Cell 


Kinase (Cdpk), Cell 






Division 


Cycle Phosphatases, 








Mitosis-Specific 








Chromosome 








Segregation Protein, 








Mitotic Phosphoprotein, 








Dna Replication Proteins, 








Helicase Telomerase, 








Centromere Protein, 








tRNA Synthase 








-Senescence- Associated 








Protein, Bifiinctional 








Nuclease, Aba Pathway 








Genes, Ethylene Pathway 






-Regulation Of 


Genes, Proteases, 






Pathways To 


Nucleases, Pcd Genes 






Senescence 


-Calvin Cycle, 








Chlorophyll A/B Binding 
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GENE 


FUNCTIONAL 


TYPE OF 


EXAMPLES OF 


EXPRESSION 


CATEGORY OF 


BIOLOGICAL 


BIOCHEMICAL 


LEVELS 


GENES 


ACTIVITY 


ACTIVITY 








Protein (Cab), 








Transketolase, 








Lipoxygenase, 








Chloroplast Rna 






-Modulation Of 


Processing Protein, 






Chloroplast Gene 


Chloroplast Envelope 






Expression And 


Membrane Protein. 






Photosysthesis 


-Glutamate Synthase, 
Gogat, Asparagine 
Synthase, Catalase, 
Peroxidase 

-Heat Shock Proteins, 






-Modulation Of 


Gst 






Photorespiration And 


-Fatty Acid Elongase- 






Primary Nitrogen 


Like Protein, Very-Long- 






Assimilation In 


Chain Fatty Acid 






Leaves 


Condensing Enzyme, Coa 






Expression 


Synthase 






-Stress Response 


-Vicilin Storage Protein 
-Homeobox Domain 






-Wax Biosynthesis 


Proteins 
-Mutase, 

Phosphoglycerate Mutase 
-Pectate Lyase, Ethylene 
Pathway Genes 






-Nutrient Metabolism 








-Embryogenesis 








-Glycolysis, 
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GENE 


FUNCTIONAL 


TYPE OF 


EXAMPLES OF 


EXPRESSION 


CATEGORY OF 


BIOLOGICAL 

i 


BIOCHEMICAL 


LEVELS 




A /^T I \ /TTV 

AC 11 VI I Y 


A PTn/TTV 

AL1IV11 Y 






Gluconeogenesis 








-Ripening 




Upregulated 


B A Late 


-BA Responsive 


-Transfactors, Kinases, 


Transcripts 


Responders 


Pathways 


Phosphatases, LRR's, 


(Higher At 6h 






Dna Remodelling 


Than lh) 






Proteins, Cu-Binding 








Proteins 






-Cell Wall Extension 


-Expansins, Extensins, 








Proline Rich Proteins 






-Organogenesis 


-AP2 Domain Containing 








Proteins 






-Modulate Activation 


-Transfactors Interacting 






Of Disease Defense 


With Resistant Genes 






Genes 


-Glycin-Rich Proteins, 








Wall-Associated 






-Modulate Responses 


Receptor Kinase (Wak) 






To External Stimuli 


-Proline Oxidase 






-Osmotic Stress 








Tolerance 




Down-Regulated 


Repressors Of BA 


-Regulation Of 


-Transfactors (Such As 


Transcripts (Low 


Pathway 


Senescence-Related 


Zinc-Finger Type), 


At Both lh and 




Gene Expression 


Kinases, Phosphatases, 


6h) 






G-Proteins, LRR 








Proteins, DNA 








Remodeling Protein 








-Carbonyl Reductases 
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GENE 

EXPRESSION 
LEVELS 


FUNCTIONAL 
CATEGORY OF 
GENES 


TYPE OF 

BIOLOGICAL ^ 
ACTIVITY 1 


EXAMPLES OF 
BIOCHEMICAL 
ACTIVITY 






-Regulation Of Genes 
Involved In 
Maintenance Of 
Apical Dominance. 


-Atpases 
-Oxygenase 
-Octaprenyltransferase 
-Auxin Pathway Genes 
-Auxin Binding Proteins 



Further, any desired sequence can be transcribed in similar temporal, tissue, or 
enviromentally specific patterns as the B A responsive genes when the desired sequence 
is operably linked to a promoter of a B A responsive gene. 
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HE. GIBBERELLIC ACID RESPONSIVE GENES, GENE COMPONENTS AND 
PRODUCTS 

Plant hormones are naturally occuring substances, effective in very small amounts, 
5 which act as signals to stimulate or inhibit growth or regulate developmental processes in 
plants. Gibberellic acid (GA) is a hormone in vascular plants that is synthesized in 
proplastids (giving rise to chloroplasts or leucoplasts) and vascular tissues. The major 
physiological responses affected by GA are seed germination, stem elongation, flower 
induction, anther development and seed and pericarp growth. GA is similar to auxins, 
10 cytokinins and gibberellins, in that they are principally growth promoters. 

Changes in GA concentration in the surrounding environment or in contact with a 
plant result in modulation of many genes and gene products. Examples of such GA 
responsive genes and gene products are shown in the Reference and Sequence Tables. These 

JEW- 

genes and/or products are responsible for effects on traits such as plant vigor and biomass and 
€5 seed yield. They were discovered and characterized from a much larger set of genes by 
p experiments designed to find genes whose mRNA products changed in concentration in 
{{J response to application of nitrogen to plants. 

EH While GA responsive polynucleotides and gene products can act alone, combinations 

p of these polynucleotides also affect growth and development. Useful combinations include 
SO different GA responsive polynucleotides and/or gene products that have similar transcription 
4? profiles or similar biological activities, and members of the same or similar biochemical 
y. pathways. Whole pathways and/or segments of pathways are controlled by transcription 
factors and proteins that affect the activity of signal transduction pathways. Therefore, 
manipulation of such protein levels is especially useful for altering phenotypes and 
25 biochemical activities of plants. In addition, the combination of a GA responsive 

polynucleotide and/or gene product with another environmentally responsive polynucleotide 
is also useful because of the interactions that exist between hormone regulated pathways, 
stress pathways, nutritional pathways and development. Here, in addition to polynucleotides 
having similar transcription profiles and/or biological activities, useful combinations include 
30 polynucleotides that may have different transcription profiles but which participate in 

common overlapping pathways. The MA_diff Table(s) reports the transcript levels of the 
experiment (see EXPT ID: 108562, 108563, 108519, 108520, 108521, 108484, 108485, 
108486). For transcripts that had higher levels in the samples than the control, a "+" is 
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shown. A "-" is shown for when transcript levels were reduced 5m)0t tips as compared to 
the control. For more experimental detail see the Example section below. 

GA genes are those sequences that showed differential expression as compared to 
controls, namely those sequences identified in the MA_diff tables with a "+" or "-" indication 



GA Genes Identified By Cluster Analyses Of Differential Expression 

GA Genes Identified By Correlation To Genes That Are Differentially 

Expressed 

As described above, the transcription profiles of genes that act together are 
well correlated. Applicants not only have identified the genes that are differentially 
expressed in the microarray experiments, but also have identified the genes that act in concert 
with them. The MA_clust table indicates groups of genes that have well correlated 
transcription profiles and therefore participate in the same pathway or network. 

A pathway or network of GA genes is any group in the MA_clust that 
comprises a cDNA ID that also appears in Expt ID 108562, 108563, 108519, 108520, 
108521, 108484, 108485, 108486 of the MA_diff table(s). 

GA Genes Identified By Correlation To Genes That Cause Physiological 

Consequences 

Additionally, the differential expression data and the phenotypic observations 
can be merged to identify pathways or networks of GA genes. A group in the MA_clust is 
considered a GA pathway or network if the group comprises a cDNA ID that also appears in 
Knock-in or Knock-out tables that causes one or more of the phenotypes described in section 
above. 



GA Genes Identified By Amino Acid Sequence Similarity 

GA genes from other plant species typically encode polypeptides that share 
amino acid similarity to the sequences encoded by corn and Arabidopsis GA genes. Groups 
of GA genes are identified in the Protein Group table. In this table, any protein group that 
comprises a peptide ID that corresponds to a cDNA ID member of a GA pathway or network 
is a group of proteins that also exhibits GA functions/utilities. 
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Such GA res^nsive genes and gene products can functioiro either increase or 
dampen the above phenotypes or activities either in response to changes in GA concentration 
or in the absence of GA fluctuations. Further, promoters of GA responsive genes, as 
described in the Reference tables, for example, are useful to modulate transcription that is 
induced by GA or any of the following phenotypes or biological activities below. 



GA responsive genes and gene products are useful to or modulate one or more of the 
following phenotypes: 

• Growth 

• Promotes Root Growth 

• Promotes Cell Division 

• Promotes Stem Elongation 

• Secondary (Woody) Growth 

• Promotes Growth In Leaves 

• Biomass 

• Increase In Stem And Leaf Mass 

• Increase In Xylem Fiber Length And Biomass Production 

• Development 
. Cell Growth 

• Fruit Development 

• Seed Development 

• Dormancy, Breaks Dormancy In Seeds And Buds 

• Promotes Trichome Formation 

• Decrease Senescence 

• Regulation Of Ferility 

• Stress Responses 

• Flowering Time 

Further, any desired sequence can be transcribed in similar temporal, tissue, or 
enviromentally specific patterns as the GA responsive genes when the desired 
sequence is operably linked to a promoter of a GA responsive gene. 



E.E.L USE OF GA RESPONSIVE GENES TO MODULATE 



PHENOTYPES: 
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To regulate any of the phenotype(s) above, activitTSsbf one or more of the GA 
response genes or gene products can be modulated and tested by screening for the 
desired trait. Specifically, the gene, mRNA levels, or protein levels can be altered in 
a plant utilizing the procedures described herein and the phenotypes can be assayed. 
As an example, a plant can be transformed according to Bechtold and Pelletier (1998, 
Methods. Mol. Biol. 82:259-266) and visually inspected for the desired phenotype or 
metabolically and/or functionally assayed according to Hedden and Proebsting (1999, 
Plant Physiol. 119:365-370), Hedden and Phillips (1999, Current Opinion in Plant 
Biotech. 11:130-137), Perazza et al (1998, Plant Physiol. 117:375-383), Kende and 
Zeevart (1997, Plant Cell 9: 1 197-1210) and van der Knaap et al. (2000, Plant Physiol. 
122:695-704). 

HE.2. USE OF GA-RESPONSIVE GENES TO MODULATE 
BIOCHEMICAL ACTIVITIES: 

The activities of one or more of the GA responsive genes can be modulated to 
change biochemical or metabolic activities and/or pathways such as those noted 
below. Such biological activities can be measured according to the citations included 
in the Table below: 



PROCESS 


BIOCHEMICAL OR 
METABOLIC ACTIVITIES 
AND/OR PATHWAYS 


CITATIONS INCLUDING 
ASSAYS 


Cell Growth and 
Differentiation 


Biosynthesis of Gas 

Cell wall loosening and cell 

expansion 

GA deactivation 

Major growth promoting 

metabolic pathways 


Hedden and Proebsting 
(1999, Plant Physiol. 
119:365-370) 
Cosgrove (1993, New 
Phytol. 124:1-23) 
Hedden and Proebsting 
(1999, Plant Physiol. 
119:365-370) 


Perception and Signal 
Transduction 


Receptors 


Koornneef and van der 
Veen (1980, TAG 58:257- 
263) 
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PROCESS 


BIOCHEMICAL OR 
METABOLIC ACTIVITIES 
AND/OR PATHWAYS 


CITATIONS INCLUDING 
ASSAYS 




Synthesis of transcriptional 
regulators 

Calcium and Calmodulin 


Bethke and Jones (1998, 
Curr. Opin. Plant Biol. 
1:440-446) 



Other biological activities that can be modulated by the GA responsive genes and 
gene products are listed in the Reference Tables. Assays for detecting such biological 
activities are described in the Protein Domain table. 

GA responsive genes are characteristically differentially transcribed in response to 
fluctuating GA levels or concentrations, whether internal or external to an organism or cell. 
The MA_diff table(s) report(s) the changes in transcript levels of various GA responsive 
genes in entire seedlings at 1 and 6 hours after a plant was sprayed with a Hoagland's 
solution enriched with GA as compared to seedlings sprayed with Hoagland's solution only. 

The data from this time course can be used to identify a number of types of GA 
responsive genes and gene products, including "early responders," and "delayed responders." 
Profiles of some GA responsive genes are shown in the Table below with examples of 
associated biological activities. 



TRANSCRIPT 


TYPE OF GENES 


PHYSIOLOGICAL 


EXAMPLES OF 


LEVELS 




CONSEQUENCES 


BIOCHEMICAL 








ACTIVITY . 


Up regulated 


Early responders to 


GA perception 


Transcription factors 


transcripts 


GA 


GA transport 


Transporters 


(level at 1 hr « 6 hr) 




Modulation of GA 


Change in cell 


(level at 1 hr>6hr) 


Genes induced by 


response 


membrane structure 




GA 


transduction 


Kinases and 






pathways 


phosphatases 






Specific gene 








transcription 


Transcription activators 






initiation 


Change in chromatin 
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TRANSCRIPT 


TYPE OF GENES 


PHYSIOLOGICAL 


EXAMPLES OF 


LEVELS 




CONSEQUENCES 


BIOCHEMICAL 








ACTIVITY 






Growth stimulating 


structure and/or 






pathway induction 


localized DNA topology 








Cell wall proteins 








Metabolic Enzymes 


Up regulated 


Maintenance of GA 


Maintenance of 


Transcription factors 


transcripts 


response 


response to GA 


Specific factors 


(level at 1 hr < 6 hr) 




Induction of GA 


(initiation and 




"Delayed" responders 


metabolic pathways 


elongation) for protein 








synthesis 








Maintenance of mRNA 








stability 








Metabolic enzymes 


Down-regulated 


Early repressor 


Negative regulation 


Transcription factors 


transcripts 


responders to GA 


of GA pathways 


Calmodulin 


(level at 1 hr « 6 hr) 




released 


Change in protein 


(level at 6 hr > 1 hr) 


Genes repressed by 


Reduced activity of 


structure by phosphor- 




GA 


repressed pathways 


ylation (kinases) or 








dephosphoryaltion 




Genes whose 




(phosphatases) 




activities are 




Change in chromatin 




diminished or 




structure and/or DNA 




mRNAs are unsTable 




topology 




in the presence of GA 






Down-regulated 


Delayed responders 


Maintenance or GA 


Transcription factors 


transcripts 




repressed pathways 


Kinases and 


(level at 1 hr > 6 hr) 


Genes repressed by 




phosphatases 




GA 




Stability factors for 




Genes whose 




protein translation 




activities are 




Metabolic enzymes 
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TRANSCRIPT 
LEVELS 


TYPE OF GENES 


PHYSIOLOGICAL 
CONSEQUENCES 


EXAMPLES OF 
BIOCHEMICAL 
ACTIVITY 




diminished or 
mRNAs are unsTable 
in the presence of GA 







Use of Promoters of GA responsive Genes 

Promoters of GA responsive genes are useful for transcription of any desired 
polynucleotide or plant or non-plant origin. Further, any desired sequence can be transcribed 
in a similar temporal, tissue, or environmentally specific patterns as the GA responsive genes 
where the desired sequence is operably linked to a promoter of a GA responsive gene. The 
protein product of such a polynucleotide is usually synthesized in the same cells, in response 
to the same stimuli as the protein product of the gene from which the promoter was derived. 
Such promoter are also useful to produce antisense mRNAs to down-regulate the product of 
proteins, or to produce sense mRNAs to down-regulate mRNAs via sense suppression. 
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m. METABOLISM AFFECTING GENES, GENE COMPONENTS AND PRODUCTS 



m.A. NITROGEN RESPONSIVE GENES, GENE COMPONENTS AND 
PRODUCTS 

5 Nitrogen is often the rate-limiting element in plant growth, and all field crops have a 

fundamental dependence on exogenous nitrogen sources. Nitrogenous fertilizer which is usually 
supplied as ammonium nitrate, potassium nitrate, or urea, typically accounts for 40% of the costs 
associated with crops, such as corn and wheat in intensive agriculture. Increased efficiency of 
nitrogen use by plants should enable the production of higher yields with existing fertilizer 
10 inputs and/or enable existing yields of crops to be obtained with lower fertilizer input, or better 
yields on soils of poorer quality. Also, higher amounts of proteins in the crops could also be 
produced more cost-effectively. 

Changes in nitrogen concentration in the surrounding environment or in contact with a 
^ plant results in modulation of the activities of many genes and hence levels of gene products, 
is Examples of such "nitrogen responsive" genes and gene products with these properties are 
% shown in the Reference, Sequence, Protein Group, Protein Group Matrix tables, MA_diff, 
n r ; MA_clust, Knock-in and Knock-out tables. These genes and/or products are responsible for 
gi effects on traits such as plant vigor and seed yield. They were discovered and characterized 

from a much larger set by experiments designed to find genes whose mRNA products changed 
&-0 in response to changing levels of available nitrogen to plants. 

I* Manipulation of one or more "nitrogen responsive" gene activities are useful to 

modulate the biological activities and/or phenotypes listed below. "Nitrogen responsive" 
genes and gene products can act alone or in combination. Useful combinations include 
nitrogen responsive genes and/or gene products with similar transcription profiles, similar 

25 biological activities, or members of the same or functionally related biochemical pathways. 
Whole pathways or segments of pathways are controlled by transcription factor proteins and 
proteins controlling the activity of signal transduction pathways. Therefore, manipulation of 
the levels of such proteins is especially useful for altering phenotypes and biochemical 
activities of plants. The MA_diff Table(s) reports the transcript levels of the experiment (see 

30 EXPT ID: 108592, 108593, 108588, 108589, 108590, 108591, 108532, 108548, 108549, 

108550, 108551, 108454, 108455, 108487, 108488, 108489, and Nitrogen (relating to SMD 
3787, SMD 3789)). For transcripts that had higher levels in the samples than the control, a 
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"+" is shown. A "- ,T Ts shown for when transcript levels were reduced in root tips as 
compared to the control For more experimental detail see the Example section below. 

Nitrogen genes are those sequences that showed differential expression as compared 
to controls, namely those sequences identified in the MA_diff tables with a "+" or "-" 
indication 




Nitrogen Genes Identified By Cluster Analyses Of Differential Expression 

Nitrogen Genes Identified By Correlation To Genes That Are Differentially 

Expressed 

As described above, the transcription profiles of genes that act together are 
well correlated. Applicants not only have identified the genes that are differentially 
expressed in the microarray experiments, but also have identified the genes that act in concert 
with them. The MA_clust table indicates groups of genes that have well correlated 
transcription profiles and therefore participate in the same pathway or network. 

A pathway or network of Nitrogen genes is any group in the MA_clust that 
comprises a cDNA ID that also appears in Expt ID 108592, 108593, 108588, 108589, 
108590, 108591, 108532, 108548, 108549, 108550, 108551, 108454, 108455, 108487, 
108488, 108489, and Nitrogen (relating to SMD 3787, SMD 3789) of the MA_diff table(s). 

Nitrogen Genes Identified By Correlation To Genes That Cause Physiological 

Consequences 

Additionally, the differential expression data and the phenotypic observations 
can be merged to identify pathways or networks of Nitrogen genes. A group in the MA_clust 
is considered a Nitrogen pathway or network if the group comprises a cDNA ID that also 
appears in Knock-in or Knock-out tables that causes one or more of the phenotypes described 
in section above. 

Nitrogen Genes Identified By Amino Acid Sequence Similarity 

Nitrogen genes from other plant species typically encode polypeptides that 
share amino acid similarity to the sequences encoded by corn and Arabidopsis Nitrogen 
genes. Groups of Nitrogen genes are identified in the Protein Group table. In this table, any 
protein group that comprises a peptide ID that corresponds to a cDNA ID member of a 
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Nitrogen pathway or network is a group of proteins that also exhibits Nitrogen 
functions/utilities. 

Such "nitrogen responsive" genes and gene products can function either to either 
increase or dampen the phenotypes and activities below, either in response to changes in 
nitrogen concentration or in the absence of nitrogen fluctuations. 

Further, promoters of nitrogen responsive genes, as described in the Reference tables, 
for example, are useful to modulate transcription that is induced by nitrogen or any of the 
following phenotypes or biological activities below. 

IHA.1.USE OF NITROGEN-RESPONSIVE GENES TO MODULATE 
PHENOTYPES 

"Nitrogen responsive" genes and gene products can be used to alter or modulate one or 
more of the following phenotypes: 

- Plant Development 

- Initiation of the Reproduction Cycle from a Vegetative State 

- Flower Development Time 

- Time to Fruit Maturity; and 

- Root Development and Initiation 

- Root Branching 

- Lateral Root, Initiation and/or Development 

- Nodule formation and nitrogen Assimilation from any Nitrogen-Fixing 



Growth Rate 

- Whole Plant, including Height, Flowering Time, etc. 

- Organs 

- Flowers 

- Fruits 

- Stems 
Leaves 

- Roots 

- Lateral Roots 

- Biomass 



Symbions. 
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- Fresh and Dry Weight during any time in plant life, such as maturation; 

- Number, Size, and Weight of 

- Flowers; 

- Seeds; 

- Branches; 

- Leaves; 

Total Plant Nitrogen Content 

- Amino Acid/Protein Content of Whole Plant or Parts 

- Seed Yield 

- Number, Size, Weight, Harvest Index 

Content and Composition, e.g., Amino Acid, Nitrogen, Oil, Protein, 
and carbohydrate 

- Fruit Yield 

- Number, Size, Weight, Harvest Index 

Content and Composition, e.g., Amino Acid, Nitrogen, Oil, Protein, 
carbohydrate, Water 

To regulate any of the phenotype(s) above, activities of one or more of the nitrogen 
responsive genes or gene products can be modulated and the plants can be tested by screening 
for the desired trait. Specifically, the gene, mRNA levels, or protein levels can be altered in a 
plant utilizing the procedures described herein and the phenotypes can be screened for 
variants as in Winkler et al. (1998) Plant Physiol 118: 743-50 and assayed, for example, in 
accordance to Zhang (1999) Proc. Natl. Acad. Sci. 96(1 1): 6529-34; or Zhang and Forde 
(1998) Science 279(5349):407-9; Scheible, W., Lauerer, M., Schultze, E.-D., Caboche, M., 
and Sitt, M. (1997). Plant J. 11, 671-691; Chevalier C, Bourgeois E, Just D, Raymond P. 
Plant J. 1996 Jan;9(l):l-ll. 

m.A.2.USE OF NITROGEN-RESPONSIVE GENES TO MODULATE 
BIOCHEMICAL ACTIVITIES 

The activities of one or more of the nitrogen responsive genes can be 
modulated to change biochemical or metabolic activities and/or pathways such as 
those noted below. Such biological activities are documented and can be measured 
according to the citations included in the Table below: 
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Process 


Biochemical Or Metabolic 
Activities And/Or Pathways 


Citations including assays 


Nitrate And Ammonium 
Uptake and Assimilation 


N0 3 " Influx And Efflux 


Lejayetal. (1999) Plant J. 18(5): 
509-519 




Nitrate Channels 


Liu et al. (1999) Plant Cell 1 1 : 865- 
874; and 

Wang et al.(1998) Proc. Natl. Acad. 
Sci. USA 95: 15134-15139 




Changes In Membrane- 
Potential 


Meharg et al. (1995) J. Membr. 
Biol. 145: 49-66; and 
Wang et al. (1998), supra 


Amino Acid Synthesis 


Glutamine Synthesis And 
Then Biosynthesis Of Other 
Amino Acids 


Coruzzi et al. U.S. Pat. No. 
5,955,651; and 

Oliveira et al. (1999) Plant. Phys. 
121:301-309 




Asparagine Synthesis And 
Then Biosynthesis Of Other 
Amino Acids 


Lam et al. (1998) Plant J. 16(3): 345- 
353 


Coordination Of Carbon 
And Nitrogen Metabolism 


Light-Regulation Of Major 
Central Carbon And 
Nitrogen Metabolic 
Pathways To Coordinate 
Growth 


Lam et al. (1998), supra; 
Lejay et al. (1999), supra; and 
Oliveira et al. (1999), supra 




Carbohydrate And Nitrogen 
Control Of Carbohydrate 
And Organic Nitrogen 
Accumulation Pathways 


Lam et al. (1998) supra; 
Lejayetal. (1999) supra; and 
Oliveira et al. (1999) supra 


Nitrogen Loading And 
Unloading 


Nitrogen Transport From 
Source To Sinks 


Walker et al. (1999) 210(1):9-18 
Elsheikh et al. (1997) 51(2):137-44. 
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Process 


Biochemical Or Metabolic 
Activities And/Or Pathways 


Citations including assays 


Nitrogen Storage 


Accumulation Of Amino 
Acids And/Or Storage 
Proteins In Vacuoles 


Johnson et al. (1990) Plant Cell 
2(6):525-32. 

Herman and Larkins (1999) Plant 
Cell. 11(4):601-14. 


Ammonium 
Detoxification 


Plastid Ammonium 

Storage/Glutamine 

Synthesis 


Crawford (1995) Plant Cell 
7(7):859-68. 

Zhang and Forde (1998) Science 
279: 407-409. 


Cell Growth 


Division And/Or Elongation 


Zhang and Forde (1998) Science 

279: 407-409. 

Cbruzzi et al. U.S. Pat. No. 

5,955,651 



Other biological activities that can be modulated by the nitrogen responsive genes and 
their products are listed in the Reference tables. Assays for detecting such biological 
activities are described in the Domain section above. 

Nitrogen responsive genes are characteristically differentially transcribed in response 
to fluctuating nitrogen levels or concentrations, whether internal or external to an organism or 
cell. The MA_diff table reports the changes in transcript levels of various nitrogen 
responsive genes in the aerial parts of a seedling at 2, 6, 9 and 12 hours after a plant was 
sprayed with a solution enriched with ammonium nitrate as compared to seedlings sprayed 
with water. The MA_diff reports the changes in transcript levels of various nitrogen 
responsive genes in roots at 12 and 24 hours that were cut from seedlings transferred from a 
high to low potassium nitrate environment compared to control seedlings transferred to a high 
potassium nitrate environment. 

The data from this time course reveal a number of types of nitrogen responsive genes 
and gene products, including "early responders," and "delayed nitrogen responders" 
Profiles of the individual categories of nitrogen responsive genes are shown in the Tables 
below together with examples of the kinds of associated biological activities that are 
modulated when the activities of one or more such genes vary in plants. 
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Low to High Ammonium Nitrate Experiment 



Gene Expression 


Functional 


Physiological 


Examples Of Gene Products 


Levels 


Category Of 


Consequences 






Gene 




* 


Upregulated 


Early 


- Perception Of 


- Transcription Factors 


Transcripts 


Responders To 


Nitrogen 


- Transporters 


(Level At 2h = 6, 


Nitrogen 


- Induced Nitrogen 


- Inhibitors Of Nitrogen 


9 Or 12h) Or 




Uptake Into Cells 


Fixation 


(Level At 2h > 6, 




- Induction Of Nitrogen 


- Components Of Pathways 


9 Or 12h) 




Response Transduction 


Released From Repression 






Pathways 


- Transaminases 






- Initiation Of Specific 


- Amino Acid Biosynthetic 






Gene Transcription 


Enzymes 


Upregulated 


Delayed 


- Maintenance Of High 


- Nitrogen Metabolic 


Transcrints 


Nitrogen 


Nitrogen Metabolism 


Pathway Enzymes 


(Level At 2h < 6, 


Responders 


And Growth 


- Transaminases 


9, Or 12h 






- Amino Acid Biosynthetic 








Enzymes 








- Factors Induced In 








Coordination And Control 








Of Central Carbon And 








Nitrogen Metabolism 








- Cell Wall And Cell 








Growth- Promoting 








Pathway Enzymes 








- Storage Proteins 
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Gene Expression 


Functional 


Physiological 


Examples Of Gene Products 


Levels 


Category Of 


Consequences 






Gene 






Down Regulated 


- Early 


- Negative Regulation 


- Transcription Factors 


Transcripts 


Responder 


Of Nitrogen Utilization 


- Kinases And Phosphatases 


(Level At 2h = 6, 


Repressors Of 


Pathways Released 


- Cytoskeletal Proteins 


9 Or 12h) Or 


Nitrogen 




Modulating Cell Structure 


(Level At 6, 9 Or 


Utilization 


-Pathways OfC AndN 


- Chromatin Structure 


12h>2h) 


Pathways 


Metabolism Required 


Regulatory Proteins 






At Lower Levels 


- Metabolic Enzymes 




- Genes With 


Decline In Presence Of 


-Transporters 




Discontinued 


High Nitrogen 


- Proteins And Rna 




Expression Or 




Turnover Systems 




UnsTable Mrna 








Following 








Nitrogen 








Uptake 






Level At 2 Hours 


- Delayed 


- Negative Regulation 


- Transcription Factors 


> 6,9 Or 12 


Response 


Of Nitrogen Utilization 


- Kinases And Phosphatases 


Hours 


Repressors Of 


Pathways Released 


- Cytoskeletal Proteins 




Nitrogen 




Modulating Cell Structure 




Utilization 


-Pathways OfC AndN 


- Chromatin Structure 




Pathways 


Metabolism Required 


Regulatory Proteins 




- Genes With 


At Lower Levels 


- Metabolic Enzymes 




Discontinued 


Decline In Presence Of 


- Transporters 




Expression Or 


High Nitrogen 


- Protein And Rna Turnover 




UnsTable Mrna 




Systems 




Following 








Nitrogen 








Uptake 
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High to Low Potassium Nitrate Experiments 



Gene Expression 


Functional 


Type Of Biological 


Examples Of Biochemical 


Levels 


Category Of Gene 


Activity 


Activities Of Gene 








Products. 


Upregulated 


Early Responders 


- Perception Of Low 


- Transcription Factors — 


Transcripts (Level 


To Low Nitrate 


Nitrate 


Controlling Transcription 


At 12h~24h) 




- Nitrogen Uptake Into 


- Transporters — Facilitating 


(Level At 




Cells 


Transport 


12h>24h) 




- Low Nitrogen Signal 


- Cell Wall/Membrane 






Transduction Response 


Structure Determining 






Pathways 


Proteins 






- Initiation Of Specific 


Kinases And 






Gene Transcription 


Phosphatases- 






- Initiation Of Nitrogen 


Reeulatine Sienal 






Fixati on 


Transduction Pathwavs 

A 1 lUXUViViV ilVll J. 14111 TT T 








Cvtoskeletal 








Proteins- Modulating Cell 








Structure 








Chromatin 








Structure And/Or Dna 








Topology Proteins 








- Protein-Protein 








Interaction Participants 








- Metabolic Enzymes- 








Nitrogen Turnover 








Enzymes And Pathway 








Components 
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Gene Expression 


Functional 


Type Of Biological 


Examples Of Biochemical 


Levels 


Category Of Gene 


Activity 


Activities Of Gene 








Products 


Unregulated 


Delayed Low 


- Maintenance Of Low 


- Transcription Factors - 


Transcripts 


Nitrate 


Nitrogen Response 


Controlling Transcription 


( Level 12h<24h) 

1 1 J ▼ \S 1. X Mil 1 11 I 


Resnonders 


Pathwavs (See the Table 


- TransDorters — Facilitating 






Above) 


Transport 








- Cell Wall/Membrane 








Structure Determining 








Proteins 








Kinases And 








Phosphatases- 








Regulating Signal 








Transduction Pathways 








Cytoskeletal 








Proteins- Modulating Cell 








Structure 








Chromatin 








Structure And/Or Dna 








Topology Proteins 








- Protein-Protein 








Interaction Participants 








- Metabolic Enzymes- 








Nitrogen Turnover 








Enzymes And Pathway 








Components 
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Gene Expression 


Functional 


Type Of Biological 


Examples Of Biochemical 


Levels 


Category Of Gene 


Activity 


Activities Of Gene 








Products 


Down-Regulated 


- Early Repressor 


-Negative Regulation Of 


- Transcription Factors 


Transcripts (Level 


Responders To 


Low Nitrogen-Mediated 


Cell 


At 12h~24h) 


Low Nitrate 


Pathways And/Or 


Wall/Membrane Structure 


(Level At 




Responses Released 


Determining Proteins 


12h>24h) 


- Genes Whose 


- Pathways In C And N 


Factors For 




Expression Is 


Metabolism Required At 


Promoting Protein 




Discontinued Or 


Lower Levels Decline In 


Translation 




Mrna Is UnsTable 


The Presence Of Low 


Kinases And 




In Presence Of 


Nitrate 


Phosphatases 




Low Nitrate 




- Cytoskeletal Proteins- 








Modulating Cell Structure 








- Protein And Rna 








Turnover Systems 


Down-Regulated 


- Delayed 


Negative Regulation Of 


- Transcription Factors 


Transcripts 


Repressor 


Low Nitrogen-Mediated 


Cell 


(Level At 


Responders To 


Pathways And/Or 


Wall/Membrane Structure 


12h<24h) 


Low Nitrate 


Responses Released 


Determining Proteins 






Pathways In C And N 


Factors For 




- Genes Whose 


Metabolism Required At 


Promoting Protein 




Expression Is 


Lower Levels Decline In 


Translation 




Discontinued Or 


The Presence Of Low 


Kinases And 




mRNAIs 


Nitrate 


Phosphatases 




UnsTable In 




- Cytoskeletal Proteins- 




Presence Of Low 




Modulating Cell Structure 




Nitrate 




- Protein And Rna 








Turnover Systems 








- Chromatin Structure 








And/Or Dna Topology 








Proteins 
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Further, any desired sequence can be transcribed in similar temporal, tissue, or 
enviromentally specific patterns as the nitrogen responsive genes when the desired sequence is 
operably linked to a promoter of a nitrogen responsive gene. 



Page 319 of 772 



• 




Attorney Diet. ^7MM4S3f 
Client Dkt. 80090.005 



m.B. CIRCADIAN RHYTHM RESPONSIVE GENES, GENE COMPONENTS 
AND PRODUCTS 

Often growth and yield are limited by the ability of a plant to tolerate stress 
conditions, including water loss. To combat such conditions, plant cells deploy a battery of 
responses that are controlled by an internal circadian clock, including the timed movement of 
cotyledons and leaves, timed movements in guard cells in stomata, and timed biochemical 
activities involved with sugar and nitrogen metabolism. These responses depend on the 
functioning of an internal circadian clock, that becomes entrained to the ambient light/dark 
cycle, and a series of downstream signaling events leading to transcription independent and 
transcription dependent stress responses. 

A functioning circadian clock can anticipate dark/light transitions and prepare the 
physiology and biochemistry of a plant accordingly. For example, expression of a chlorophyll 
a/b binding protein (CAB) is elevated before daybreak, so that photosynthesis can operate 
maximally as soon as there is light to drive it. Similar considerations apply to light/dark 
transitions and to many areas of plant physiology such as sugar metabolism, nitrogen 
metabolism, water uptake and water loss, flowering and flower opening, epinasty, 
germination, perception of season, and senescence. 

Manipulation of one or more clock gene activities are useful to modulate the 
biological processes and/or phenotypes listed below. Clock responsive genes and gene 
products can act alone or in combination. Useful combinations include clock responsive 
genes and/or gene products with similar transcription profiles, similar biological activities, or 
members of the same or functionally related biochemical pathways. Whole pathways or 
segments of pathways are controlled by transcription factor proteins and proteins controlling 
the activity of signal transduction pathways. Therefore, manipulation of such protein levels is 
especially useful for altering phenotypes and biochemical activities of plants. The MA_diff 
Table(s) reports the transcript levels of the experiment (see EXPT ID: Circadian (relating to 
SMD 2344, SMD 2359, SMD 2361, SMD 2362, SMD 2363, SMD 2364, SMD 2365, SMD 
2366, SMD 2367, SMD 2368, SMD 3242)). For transcripts that had higher levels in the 
samples than the control, a "+" is shown. A "-" is shown for when transcript levels were 
reduced in root tips as compared to the control. For more experimental detail see the 
Example section below. 
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Circadian genes are those sequences that showed differential expression as compared 
to controls, namely those sequences identified in the MA_diff tables with a "+" or "-" 
indication 

5 Circadian Genes Identified By Cluster Analyses Of Differential Expression 

Circadian Genes Identified By Correlation To Genes That Are Differentially 

Expressed 

As described above, the transcription profiles of genes that act together are 
well correlated. Applicants not only have identified the genes that are differentially 
10 expressed in the microarray experiments, but also have identified the genes that act in concert 
with them. The MA_clust table indicates groups of genes that have well correlated 
transcription profiles and therefore participate in the same pathway or network. 

A pathway or network of Circadian genes is any group in the MA_clust that 
C comprises a cDNA ID that also appears in Expt ID Circadian (relating to SMD 2344, SMD 
B 2359, SMD 2361, SMD 2362, SMD 2363, SMD 2364, SMD 2365, SMD 2366, SMD 2367, 
J SMD 2368, SMD 3242) of the MA_diff table(s). 

: sr. 

u\ 

rf; Circadian Genes Identified By Correlation To Genes That Cause 

JU Physiological Consequences 

Wb Additionally, the differential expression data and the phenotypic observations 

F: : 

: ; : 

4* can be merged to identify pathways or networks of Circadian genes. A group in the 

h J MA_clust is considered a Circadian pathway or network if the group comprises a cDNA ID 

that also appears in Knock-in or Knock-out tables that causes one or more of the phenotypes 

described in section above. 

25 

Circadian Genes Identified By Amino Acid Sequence Similarity 

Circadian genes from other plant species typically encode polypeptides that 
share amino acid similarity to the sequences encoded by corn and Arabidopsis Circadian 
genes. Groups of Circadian genes are identified in the Protein Group table. In this table, any 
30 protein group that comprises a peptide ID that corresponds to a cDNA ID member of a 
Circadian pathway or network is a group of proteins that also exhibits Circadian 
functions/utilities. 
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Such clock responsive genes and gene products can function to either increase or 
dampen the above phenotypes or activities either in response to changes in daylength or. in 
response to changes in light quality. Further, promoters of cirdadian (clock)responsive genes, 
as described in the Reference tables, for example, are useful to modulate transcription that is 
induced by circadian or any of the following phenotypes or biological activities below. 
Further, any desired sequence can be transcribed in similar temporal, tissue, or 
enviromentally specific patterns as the circadian (clock) responsive genes when the desired 
sequence is operably linked to a promoter of a circadian (clock) responsive gene. 

The expression of many genes is modulated by the clock. Microarray technology 
allows monitoring of gene expression levels for thousands of genes in a single experiment. 
This is achieved by hybridizing labeled fluorescent cDNA pools to glass slides that contain 
spots of DNA (Schena et al. (1995) Science 270: 467-70). The US Arabidopsis Functional 
Genomics Consortium (AFGC) has recently made public the results from such microarray 
experiments conducted with AFGC chips containing some 10,000 non-redundant ESTs, 
selected from about 37,000 randomly sequenced ESTs generated from mRNA of different 
tissues and developmental stages. 

The sequences of the ESTs showing at least two-fold increases or decreases in 
response to the circadian rhythm clock at various times through the 24 hour cycle relative to 
the controls were identified, compared to the Ceres full length cDNA and genomic sequence 
databanks, and equivalent Ceres clones identified. The MA_diff table reports the results of 
this analysis, indicating those Ceres clones which are up or down regulated over controls, 
thereby indicating the Ceres clones which represent clock responsive genes. 

IHB.l.USE OF CIRCADIAN RHYTHM (CLOCK) RESPONSIVE GENES 
TO MODULATE PHENOTYPES 
Clock responsive genes and gene products are useful to or modulate one or more of 
the following phenotypes: 

• Timing Phenotypes 

• Dormancy 

• Germination 

• Cotyledon opening 

• First leaves 
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• Juvenile to adult transition 

• Bolting 

• Flowering 

• Pollination 

• Fertilization 

• Seed development 

• Seed set 

• Fruit Drop 

• Senescence 
Epinasty 

• Biomass 

• Fresh and Dry Weight during any time in plant life, such as maturation 

• Number of Flowers, Seeds, Branches, and/or Leaves 

• Seed Yield, including Number, Size, Weight, and/or Harvest Index 

• Fruit Yield, including Number, Size, Weight, and/or Harvest Index 

• Plant Development 

• Time to Fruit Maturity 

• Cell Wall Strengthening and Reinforcement 

• Stress Tolerance 

• Drought tolerance 

• Flooding tolerance 

• UV tolerance 



To regulate any of the phenotype(s) above, activities of one or more of the clock 
responsive genes or gene products can be modulated and the plants can be tested by screening 
for the desired trait. Specifically, the gene, mRNA levels, or protein levels can be altered in a 
plant utilizing the procedures described herein and the phenotypes can be screened for 
variants as in Anderson et al. (1997) Plant Cell 9: 1727-1743; Heintzen et al. (1997) Proc. 
Natl. Acad. ScL USA 94: 8515-20; Schaffer et al. (1998) Cell 93:1219-1229; Somers et al. 
(1998) Development 125: 485-494; Somers et al. (1998) Science 282: 1488-1490; Wang and 
Tobin (1998) Cell 93: 1207-1217; Zhong et al. (1998) Plant Cell 10: 2005-2017; Sugano et 
al. (1998) Proc. Natl. Acad. Sci. USA 95: 1 1020-1 1025; Dowson-Day and Millar (1999) 
Plant J 17: 63-71; Green and Tobin (1999) Proc. Natl. Acad. Sci. USA 96: 4176-419; Staiger 
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and Apel (1999) Mol. Gen. Genet. 261: 811-819; Strayer and Kay (1999) Curr. Opin. Plant 
Biol. 2:1 14-120; Strayer et. al. (2000) Science 289:768-771; Kreps et al. (2000) J Biol 
Rhythms (2000) 15:208-217; Nelson et al. (2000) Cell 101:331-340; Somers et al. (2000) 
Cell 101:319-329. 

m.B.2. USE OF ACTIVE CLOCK RESPONSIVE GENES TO MODULATE 
BIOCHEMICAL ACTIVITIES 
The activities of one or more of the clock responsive genes can be modulated to 
change biochemical or metabolic activities and/or pathways such as those noted below. Such 
biological activities are documented and can be measured according to the citations above 
and included in the Table below: 



PROCESS 


BIOCHEMICAL OR 
METABOLIC ACTIVITIES 
AND/OR PATHWAYS 


CITATIONS INCLUDING 
ASSAYS 


Germination and seedling 
development 


Cold, light and water modulated 
signal transduction pathways, 
receptors, kinases, PAS domain 


Bognaretal. (1999) Proc. 
Natl. Acad. Sci. USA 
96:14652-14657; Sugano et al 
(1999) Proc. Natl. Acad. Sci. 
USA 96:12362-12366; 
Dowson-Day and Millar 
(1999) Plant J 17: 63-71; 
Somers et al. (2000) Cell 
101:319-329; Zhongetal. 
(1998) Plant Cell 10:2005- 
2017 


Growth transitions and 
flowering 


Cold and light modulated signal 
transduction pathways, 
receptors, kinases, PAS domain 


Nelson et al. (2000) Cell 
101:331-340; Fowler etal. 
(1999)EMBOJ. 18:4679- 
4688 


Tuber formation 


Cold and light modulated signal 
transduction pathways 


Yanovsky et al. (2000) Plant 
J. 23: 223-232 



Page 324 of 772 




Attorney Dkt. 2750-1483P 
Client Dkt. 80090.005 



PROCESS 


BIOCHEMICAL OR 
METABOLIC ACTIVITIES 
AND/OR PATHWAYS 


CITATIONS INCLUDING 
ASSAYS 


METABOLISM 






Lipid metabolism 


Membrane lipid synthesis 
including omega-3 fatty acid 
desaturase, lipases, lipid 
transfer proteins 


Bradley and Reddy (1997) J. 
Bacterid. 179: 4407-4410; 
Martin, M et al. 1999 Europe 
J. Biochem 262: 283-290 


Sugar metabolism 


Glycosylhydrolases, 
glycosyltransferases, amylases, 
sucrose synthase, CAB, 
Rubisco, light signal 
transduction 


Liu etal. (1996) Plant 
Physiol. 112:43-51; Millar 
and Kay (1996) Proc Natl 
AcadSciUS A 93:15491- 
15496; Wang etal. (1997) 
Plant Cell 9:491-507; 
Shinohara et al (1999) J. Biol. 
Chem. 273: 446-452 


Nitrogen metabolism 


Aminotransferases, arginase, 
proteases and vegetative storage 
proteins, aromatic amino acid 
synthesis 


Bradley and Reddy (1997) J. 
Bacterid. 179: 4407-4410 


Photorespiration 


Mitochondrial, chloroplast and 
peroxisomal photorespiratory 
enzymes, serine hydroxymethyl 
transferases, catalase 


Zhong and McClung (1996) 
Mol. Gen. Genet. 251:196- 
203; McClung (1997) Free. 
Radic. Biol. Med. 23:489- 
496; McClung et al. (2000) 
Plant Physiol. 123:381-392 


Responses to 
Environmental Stress 


Expression of genes involved in 
responses to drought, salt, UV 


McClung (1997) Free Radic 
Biol Med 23:489-496; Shi et 
al. (2000) Proc. Natl. Acad. 
Sci. USA 97:6896-6901 
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Other biological activities that can be modulated by the clock responsive genes and 
their products are listed in the Reference tables. Assays for detecting such biological 
activities are described in the Protein Domain table. 

Clock responsive genes are characteristically differentially transcribed in response to 
fluctuations in an entrained oscillator, which is internal to an organism and cell. The 
MA diff table(s) report(s) the changes in transcript levels of various clock responsive genes 
in a plant. 

Profiles of clock responsive genes are shown in the table below with examples of 
which associated biological activities are modulated when the activities of one or more such 
genes vary in plants. 



TRANSCRIPT 
LEVELS 


TYPE OF GENES 


PHYSIOLOGICAL 
CONSEQUENCES 


EXAMPLES OF 
BIOCHEMICAL 
ACTIVITY 


Up regulated 


Responders to 


• Circadian rhythm 




transcripts 


circadian rhythm 


perception 


• Metabolic enzymes 






• Metabolisms 


• Change in cell 






affected by 


membrane structure 




Genes induced by 


Circadian rhythm 


and potential 




rythm 


Synthesis of 


• Kinases and 






secondary 


phosphatases 






metabolites 


• Transcription 






and/or proteins 


activators 






• Modulation of 


• Change in 






clock response 


chromatin structure 






transduction 


and/or localized 






pathways 


DNA topology 






Specific gene 


• Enzymes in lipid, 






transcription 


sugar and nitrogen 






initiation 


metabolism 








• Enzymes in 








photorespiration 








and photosynthesis 



Page 326 of 772 



Attorney JJKt. 

Client Dkt. 80090.005 











Down-regulated 


Responders to 


• Negative 


• Transcription 


transcripts 


circadian rhythm. 


regulation of 


factors 






circadian 


• Change in protein 




Repressors of 


pathways released 


structure by 




circadian "state" of 




phosphorylation 




metabolism 


Changes in 


(kinases) or 






pathways and 


dephosphoryaltion 




Genes repressed by 


processes 


(phosphatases) 




rhythm 


operating in cells 


Change in 








chromatin structure 




Genes with 


• Changes in 


and/or DNA 




discontinued 


metabolism other 


topology 




expression or 


than circadian 


Stability of factors 




unsTable mRNA in 


pathways 


for protein 




presence of zinc 




synthesis and 








degradation 








• Metabolic enzymes 








in light, sugar, lipid 








and nitrogen 








metabolism 



Use of Promoters of Clock responsive Genes 

Promoters of Clock responsive genes are useful for transcription of any desired 
polynucleotide or plant or non-plant origin. Further, any desired sequence can be transcribed 
in a similar temporal, tissue, or environmentally specific patterns as the Clock responsive 
genes where the desired sequence is operably linked to a promoter of a Clock responsive 
gene. The protein product of such a polynucleotide is usually synthesized in the same cells, 
in response to the same stimuli as the protein product of the gene from which the promoter 
was derived. Such promoter are also useful to produce antisense mRNAs to down-regulate 
the product of proteins, or to produce sense mRNAs to down-regulate mRNAs via sense 
suppression. 
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m.C. BLUE LIGHT (PHQTOTROPISM) RESPONSIVE GENES, GENE 
COMPONENTS AND PRODUCTS 

Phototropism is the orientation or growth of a cell, an organism or part of an organism 
in relation to a source of light. Plants can sense red (R), far-red (FR) and blue light in their 
environment and respond differently to particular ratios of these. For example, a low R:FR 
ratio enhances cell elongation and favors flowering over leaf production, but blue light 
regulated cryptochromes also appear to be involved in determining hypocotyl growth and 
flowering time. 

Phototropism of Arabidopsis thaliana seedlings in response to a blue light source is 
initiated by nonphototropic hypocotyl 1 (NPH1), a blue light-activated serine-threonine 
protein kinase, but the downstream signaling events are not entirely known. Blue light 
treatment leads to changes in gene expression. These genes have been identified by 
comparing the levels of mRNAs of individual genes in dark-grown seedlings, compared with 
in dark grown seedlings treated with 1 hour of blue light. Auxin also affects blue light 
phototropism. The effect of auxin on gene expression stimulated by blue light has been 
explored by studying mRNA levels in a mutant of Arabidopsis thaliana nph4-2, grown in the 
dark and, treated with blue light for 1 hour compared with wild type seedlings treated 
similarly. This mutant is disrupted for auxin-related growth and auxin-induced gene 
transcription. Gene expression was studied using microarray technology. 

Microarray technology allows monitoring of gene expression levels for thousands of 
genes in a single experiment. This is achieved by hybridizing labeled fluorescent cDNA pools 
to glass slides that contain spots of DNA (Schena et al. (1995) Science 270: 467-70). The US 
Arabidopsis Functional Genomics Consortium (AFGC) has recently made public the results 
from such microarray experiments conducted with AFGC chips containing some 10,000 non- 
redundant ESTs, selected from about 37,000 randomly sequenced ESTs generated from 
mRNA of different tissues and developmental stages. 

The sequences of the ESTs showing at least two-fold increases or decreases over the 
controls were identified, compared to the Ceres full-length cDNA and genomic sequence 
databanks, and the equivalent Ceres clones identified. The MA_diff table(s)report(s) the 
results of this analysis, indicating those Ceres clones which are up or down regulated over 
controls, thereby indicating the Ceres clones which represent blue light responsive genes and 
of those which are blue light responsive in the absence of nph4 gene activity. The MA_diff 
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Table(s) reports the transcript levels of the experiment (see EXPT ID: Phototropism (relating 
to SMD 4188, SMD 6617, SMD 6619)). For transcripts that had higher levels in the samples 
than the control, a "+" is shown. A "-" is shown for when transcript levels were reduced in 
root tips as compared to the control. For more experimental detail see the Example section 
below. 

Blue Light genes are those sequences that showed differential expression as compared 
to controls, namely those sequences identified in the MA_diff tables with a "+" or "-" 
indication 



10 Blue Light Genes Identified By Cluster Analyses Of Differential Expression 

Blue Light Genes Identified By Correlation To Genes That Are Differentially 

Expressed 

As described above, the transcription profiles of genes that act together are 
5 well correlated. Applicants not only have identified the genes that are differentially 
C5 expressed in the microarray experiments, but also have identified the genes that act in concert 
S with them. The MA_clust table indicates groups of genes that have well correlated 

| If transcription profiles and therefore participate in the same pathway or network. 

yj 

fn A pathway or network of Blue Light genes is any group in the MA_clust that 

~ comprises a cDNA ID that also appears in Expt ID Phototropism (relating to SMD 4188, 

lb SMD 6617, SMD 6619) of the MA_diff table(s). 

si i 
*« : 

Blue Light Genes Identified By Correlation To Genes That Cause 
Physiological Consequences 

Additionally, the differential expression data and the phenotypic observations 
25 can be merged to identify pathways or networks of Blue Light genes. A group in the 

MA_clust is considered a Blue Light pathway or network if the group comprises a cDNA ID 
that also appears in Knock-in or Knock-out tables that causes one or more of the phenotypes 
described in section above. 

30 Blue Light Genes Identified By Amino Acid Sequence Similarity 

Blue Light genes from other plant species typically encode polypeptides that 
share amino acid similarity to the sequences encoded by corn and Arabidopsis Blue Light 
genes. Groups of Blue Light genes are identified in the Protein Group table. In this table, 
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any protein group that comprises a peptide ID that corresponds to a cDNA ID member of a 
Blue Light pathway or network is a group of proteins that also exhibits Blue Light 
functions/utilities. 

m.C.l.USE OF BLUE LIGHT RESPONSIVE GENES, GENE 
COMPONENTS AND PRODUCTS TO MODULATE 
PHENOTYPES 

Changes in blue light in a plant's surrounding environment result in modulation of many 
genes and gene products. Examples of such blue light response genes and gene products are 
shown in the REFERENCE and SEQUENCE Tables. These genes and/or products are 
responsible for effects on traits such as plant vigor and seed yield. 

While blue light responsive polynucleotides and gene products can act alone, 
combinations of these polynucleotides also affect growth and development. Useful 
combinations include different blue light responsive polynucleotides and/or gene products 
that have similar transcription profiles or similar biological activities, and members of the 
same or similar biochemical pathways. Whole pathways or segments of pathways are 
controlled by transcription factor proteins and proteins controlling the activity of signal 
transduction pathways. Therefore, manipulation of such protein levels is especially useful for 
altering phenotypes and biochemical activities of plants. In addition, the combination of a 
blue light responsive polynucleotides and/or gene product with other environmentally 
responsive polynucleotide is also useful because of the interactions that exist between 
hormone regulated pathways, stress and pathogen induced pathways, nutritional pathways 
and development. Here, in addition to polynucleotides having similar transcription profiles 
and/or biological activities, useful combinations include polynucleotides that may have 
different transcription profiles but which participate in common or overlapping pathways. 

IILC.2.USE OF BLUE LIGHT RESPONSIVE GENES, GENE 
COMPONENTS AND PRODUCTS TO MODULATE 
PHENOTYPES 

Blue light responsive genes and gene products can function to either increase or 
dampen the above phenotypes or activities either in response to changes in blue light 
response concentration or in the absence of blue light responsive fluctuations. Further, 
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promoters of blue light responsive genes, as described in the Reference tables, for example, 
are useful to modulate transcription that is induced by blue light or any of the following 
phenotypes or biological activities below. Further, any desired sequence can be transcribed in 
similar temporal, tissue, or enviromentally specific patterns as the blue light responsive genes 
when the desired sequence is operably linked to a promoter of a blue light responsive gene. 

Blue light responsive genes and gene products can be used to alter or modulate one or 
more of the following phenotypes: 

- Growth 

- Roots 

- Elongation 

- gravitropism 

- Stems 

- Elongation 

- Development 

- Cell 

- Growth 

- Elongation 
Flower 

Flowering time 
Seedling 

- Elongation 

- Plant Yield 

Seed and fruit yield 




To regulate any of the phenotype(s) above, activities of one or more of the blue 
light responsive genes or gene products can be modulated and the plants tested by 
screening for the desired trait. Specifically, the gene, mRNA levels, or protein levels can 
be altered in a plant utilizing the procedures described herein and the phenotypes can be 
J assayed. As an example, a plant can be transformed according to Bechtold and Pelletier 
(1998, Methods. Mol. Biol. 82:259-266) and/or screened for variants as in Winkler et 
al. (1998) Plant Physiol 118: 743-50 and visually inspected for the desired phenotype or 
metabolically and/or functionally assayed according to Liscum and Briggs (1995, Plant 
Cell 7: 473-85), Vitha et al. (2000, Plant Physiol 122: 453-61), Stowe-Evance et al. 



Page 331 of 772 




Attorney Ukt. ^"/30-148'iF 
Client Dkt. 80090.005 



(1998, Plant Physiol 118: 1265-75), Baum et al. (1999, PNAS USA 96: 13554-9), Huala 
et al. (1997) Science 278: 2120-2123), Kanegae et al. (2000, Plant Cell Physiol 41: 415- 
23), Khanna et al. (1999, Plant Mol Biol 39: 231-42), Sakai et al. (2000, Plant Cell 12: 
225-36), Parks et al (1996, Plant Physiol 1 10: 155-62) and Janoudi et al. (1997, Plant 
Physiol 113: 975-79). 

m.CJ.USE OF BLUE LIGHT RESPONSIVE GENES, GENE 
COMPONENTS AND PRODUCTS TO MODULATE 
BIOCHEMICAL ACTIVITIES 
The activities of one or more of the blue light responsive genes can be modulated 
to change biochemical or metabolic activities and/or pathways such as those noted below. 
Such biological activities can be measured according to the citations included in the Table 
below: 



PROCESS 


BIOCHEMICAL OR 
METABOLIC 
ACTIVITIES AND/OR 
PATHWAYS 


CITATIONS INCLUDING 
ASSAYS 


Cell Growth and 
Development 


Cell Elongation 

• Seedling 

• Stem 


Liscum and Briggs (1995) Plant 
Cell 7: 473-85 




. Root 


Vitha et al. (2000) Plant Physiol 
122: 453-61 


Signalling 


UV light Perception 


Liscum and Briggs (1996) Plant 
Physiol 112:291-96 




Far-red/Red light Perception 


Parks et al. (1996) Plant Physiol 
110: 155-62 




Phosphorylation of cellular 
and nuclear-localized 
proteins 


Liscum and Briggs (1996) Plant 
Physiol 112: 291-96 




Activation and Synthesis of 
Transcription Factors 


Sakae et al. (2000) Plant Cell 12: 
225-36 
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PROCESS 


BIOCHEMICAL OR 
METABOLIC 
ACTIVITIES AND/OR 
PATHWAYS 


CITATIONS INCLUDING 
ASSAYS 




Ca+2 levels 


Baum et al. (1999) PNAS USA 
96: 13554-9 

Pu and Robinson (1998) J Cell 
Sci 111: 3197-3207 




Auxin Concentration 


Estelle(1998) Plant Cell 10: 
1775-8 

Reed et al. (1998) Plant Physiol 
118: 1369-78 




Inter-photoreceptors 


Janoudi et al. (1997) Plant 
Physiol 113:975-79 



Other biological activities that can be modulated by blue light response genes and then- 
products are listed in the REF Tables. Assays for detecting such biological activities are 
described in the Domain section of the REF Table. 

The specific genes modulated by blue light, in wild type seedlings and in the mutant 
deficient in transmitting auxin effects are given in the Reference and Sequence Tables . The 
kinds of genes discovered and some of their associated effects are given in the Table below. 



TRANSCRIPT 
LEVELS 


TYPE OF GENES 


PHYSIOLOGICAL 
CONSEQUENCES 


EXAMPLES OF 
BIOCHEMICAL 
ACTIVITY 


Up regulated 


Responders to no 


. Blue light 


• Transporters 


transcripts 


blue light in wild type 


perception 


• Metabolic enzymes 




or to blue light in 


• Metabolism 


• Change in cell 




mutant lacking auxin 


affected by blue 


membrane structure 




effects 


light 


and potential 






Synthesis of 


• Kinases and 






secondary 


phosphatases 
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TRANSCRIPT 


TYPE OF GENES 


PHYSIOLOGICAL 


EXAMPLES OF 


LEVELS 




CONSEQUENCES 


BIOCHEMICAL 








ACTIVITY 






metabolites and/or 


• Transcription 






proteins 


activators 






Modulation of blue 


Change in chromatin 






light transduction 


structure and/or 






pathways 


localized DNA 






Specific gene 


topology 






transcription 








initiation 




Down-regulated 


Responders to no 


. Blue light 


• Transcription factors 


transcripts 


blue light in wild type 


perception 


• Change in protein 




or to blue light in 


• Metabolism 


structure by 




mutants lacking auxin 


affected by blue 


phosphorylation 




effects 


light 


(kinases) or 






Synthesis of 


dephosphorylation 






secondary 


(phosphatases) 




Genes with 


metabolites and/or 


Change in chromatin 




discontinued 


proteins 


structure and/or 




expression or 


• Modulation of blue 


DNA topology 




unsTable mRNA 


light transduction 


. Stability factors for 




during response 


pathways 


protein synthesis and 






Specific gene 


degradation 






transcription 


• Metabolic enzymes 






initiation 








Changes in 








pathways, and 








processes 








operating in cells 








• Changes in 








metabolic 
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TRANSCRIPT 
LEVELS 


TYPE OF GENES 


PHYSIOLOGICAL 
CONSEQUENCES 


EXAMPLES OF 
BIOCHEMICAL 
ACTIVITY 






pathways other 
than phototropic 
blue light 
responsive 
pathways 





Use of Promoters of Blue Light responsive Genes 

Promoters of Blue Light responsive genes are useful for transcription of any desired 
polynucleotide or plant or non-plant origin. Further, any desired sequence can be transcribed 
in a similar temporal, tissue, or environmentally specific patterns as the Blue Light 
responsive genes where the desired sequence is operably linked to a promoter of a Blue Light 
responsive gene. The protein product of such a polynucleotide is usually synthesized in the 
same cells, in response to the same stimuli as the protein product of the gene from which the 
promoter was derived. Such promoter are also useful to produce antisense mRNAs to down- 
regulate the product of proteins, or to produce sense mRNAs to down-regulate mRNAs via 
sense suppression. 
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m.D. RESPONSIVE GENES, GENE COMPONENTS AND PRODUCTS 

There has been a recent and significant increase in the level of atmospheric carbon 
dioxide. This rise in level is projected to continue over the next 50 years. The effects of the 
increased level of carbon dioxide on vegetation are just now being examined, generally in 
large scale, whole plant (often trees) experiments. Some researchers have initiated 
physiological experiments in attempts to define the biochemical pathways that are either 
affected by and/or are activated to allow the plant to avert damage from the elevated carbon 
dioxide levels. A genomics approach to this issue, using a model plant system, allows 
identification of those pathways affected by and/or as having a role in averting damage due to 
the elevated carbon dioxide levels and affecting growth. Higher agronomic yields can be 
obtained for some crops grown in elevated CO2. 

Microarray technology allows monitoring of gene expression levels for thousands of 
genes in a single experiment. This is achieved by hybridizing labeled fluorescent cDNA pools 
to glass slides that contain spots of DNA (Schena et al. (1995) Science 270: 467-70). The 
U.S. Arabidopsis Functional Genomics Consortium (AFGC) has recently made public the 
results from such microarray experiments conducted with AFGC chips containing about 
10,000 non-redundant ESTs, selected from about 37,000 randomly sequenced ESTs 
generated from mRNA of different tissues and developmental stages. 

The sequences of the ESTs showing at least two-fold increases or decreases in plants 
grown in higher CO2 levels compared with plants grown at more normal CO2 levels, were 
compared to the Ceres full length cDNA and genomic sequence databanks, and equivalent 
Ceres clones were identified. The MA_diff table reports the results of this analysis, indicating 
those Ceres clones which are up or down regulated over controls, thereby indicating the 
Ceres clones cDNA sequences that change in response to CO2. 

Examples of CO2 responsive genes and gene products are shown in the Reference, 
Sequence, Protein Group, Protein Group Matrix tables, MA_diff and MA_clust tables. While 
CO2 responsive polynucleotides and gene products can act alone, combinations of these 
polynucleotides also affect growth and development. Useful combinations include different 
CO2 responsive polynucleotides and/or gene products that have similar transcription profiles 
or similar biological activities, and members of the same or similar biochemical pathways. 
Whole pathways or segments of pathways are controlled by transcription factor proteins and 
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proteins controlling the activity of signal transduction pathways. Therefore, manipulation of 
such protein levels is especially useful for altering phenotypes and biochemical activities of 
plants. 

Manipulation of one or more CO2 responsive gene activities are useful to modulate 
the biological processes and/or phenotypes listed below. CO2 responsive genes and gene 
products can act alone or in combination. Useful combinations include genes and/or gene 
products with similar transcription profiles, similar biological activities, or members of the 
same or functionally related biochemical pathways. Here, in addition to polynucleotides 
having similar transcription profiles and/or biological activities, useful combinations include 
polynucleotides that may have different transcription profiles but which participate in 
common or overlapping pathways. 

CO2 responsive genes and gene products can function to either increase or dampen the 
above phenotypes or activities. Further, promoters of CO2 responsive genes, as described in 
the Reference tables, for example, are useful to modulate transcription that is induced by CO2 
or any of the following phenotypes or biological activities below. Further, any desired 
sequence can be transcribed in similar temporal, tissue, or enviromentally specific patterns as 
the CO2 responsive genes when the desired sequence is operably linked to a promoter of a 
C0 2 responsive gene. The MA_diff Table(s) reports the transcript levels of the experiment 
(see EXPT ID: C02 (relating to SMD7561, SMD 7562, SMD 7261, SMD 7263, SMD 3710, 
SMD 4649, SMD 4650)). For transcripts that had higher levels in the samples than the 
control, a is shown. A "-" is shown for when transcript levels were reduced in root tips 
as compared to the control. For more experimental detail see the Example section below. 

C02 genes are those sequences that showed differential expression as compared to 
controls, namely those sequences identified in the MA_diff tables with a or "-" indication 

CQ2 Genes Identified By Cluster Analyses Of Differential Expression 

CQ2 Genes Identified By Correlation To Genes That Are Differentially 

Expressed 

As described above, the transcription profiles of genes that act together are 
well correlated. Applicants not only have identified the genes that are differentially 
expressed in the microarray experiments, but also have identified the genes that act in concert 
with them. The MA_clust table indicates groups of genes that have well correlated 
transcription profiles and therefore participate in the same pathway or network. 
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A pathway or network of C02 genes is any group in the MA_clust that 
comprises a cDNA ID that also appears in Expt ID C02 (relating to SMD7561, SMD 7562, 
SMD 7261, SMD 7263, SMD 3710, SMD 4649, SMD 4650) of the MA_diff table(s). 



CQ2 Genes Identified By Correlation To Genes That Cause Physiological 

Consequences 

Additionally, the differential expression data and the phenotypic observations 
can be merged to identify pathways or networks of C02 genes. A group in the MA_clust is 
considered a C02 pathway or network if the group comprises a cDNA ID that also appears in 
Knock-in or Knock-out tables that causes one or more of the pheno types described in section 
above. 



CQ2 Genes Identified By Amino Acid Sequence Similarity 

C02 genes from other plant species typically encode polypeptides that share 
amino acid similarity to the sequences encoded by corn and Arabidopsis C02 genes. Groups 
of C02 genes are identified in the Protein Group table. In this table, any protein group that 
comprises a peptide ID that corresponds to a cDNA ID member of a C02 pathway or 
network is a group of proteins that also exhibits C02 functions/utilities. 



ni.D.l.USE OF CQ2 RESPONSIVE GENES TO MODULATE 
PHENOTYPES 

CO2 responsive genes and gene products are useful to or modulate one or 
more of the following pheno types: 

- Catabolism 

Energy Generation, ATP, etc. 

- Metabolism 
Carbohydrate Synthesis 

- Growth Rate 

- Whole Plant, including Height, Flowering Time, etc. 
Organs 

- Flowers 

- Fruits 
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w w 

- Stems 

- Leaves 

- Roots 

- Lateral Roots 
5 - Biomass 

- Fresh and Dry Weight during any time in plant life, such as maturation; 

- Number, Size, and Weight of 

- Flowers; 

- Seeds; 

10 - Branches; 

Leaves; 

- Total Plant Nitrogen Content 

- Amino Acid/Protein Content of Whole Plant or Parts 
C j - Seed Yield 

ig - Number, Size, Weight, Harvest Index 

- Content and Composition, e.g., Amino Acid, Nitrogen, Oil, Protein, and 
Hi- Carbohydrate 
ffi - Fruit Yield 

L - Number, Size, Weight, Harvest Index 

w 

SO - Content and Composition, e.g., Amino Acid, Nitrogen, Oil, Protein, Carbohydrate, 

1U 

£ Water 

'r : - Photosynthesis 

- Carbon Dioxide Fixation 
To improve any of the phenotype(s) above, activities of one or more of the CO2 

25 responsive genes or gene products can be modulated and the plants tested by screening for 
the desired trait. Specifically, the gene, mRNA levels, or protein levels can be altered in a 
plant utilizing the procedures described herein and the phenotypes can be assayed. As an 
example, a plant can be transformed according to Bechtold and Pelletier (1998, Methods. 
Mol. Biol. 82:259-266) and/or screened for variants as in Winkler et al. (1998) Plant Physiol 

30 118: 743-50 and visually inspected for the desired phenotype or metabolically and/or 

functionally assayed according to Saito et al. (1994, Plant Physiol. 106: 887-95), Takahashi et 
al (1997, Proc. Natl. Acad. Sci. USA 94: 1 1 102-07) and Koprivova et al. (2000, Plant 
Physiol. 122: 737-46). 
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m.D.2.USE OF CQ 2 RESPONSIVE GENES TO MODULATE 
BIOCHEMICAL ACTIVITIES 
The activities of one or more of the CO2 responsive genes can be modulated to change 
biochemical or metabolic activities and/or pathways such as those noted below. Such 
biological activities can be measured according to the citations included in the Table below: 



GENERAL CATEGORY 


BIOCHEMICAL OR 
METABOLIC 
ACTIVITIES AND/OR 
PATHWAYS 


CITATIONS INCLUDING 
ASSAYS 


Cell Division 


Cell Cycle Control Genes 


Masle (2000) Plant 
Physiol. 122: 1399-1415 


Starch Biosynthesis 


Starch Biosynthesis 
Enzymes And Pathways 


Ludewig et al., (1998) 
FEBS Lett. 429: 147-151 


Photosynthesis 


Photosynthetic Enzymes, 
e.g., Rubisco 


Cheng et al., (1998) Plant 
Physiol 166: 715-723 


Respiration 


Energy Metabolism 
Pathways 


Musgrave et al., (1986) 
Proc. Natl. Acad. Sci. USA 
83: 8157-8161 


C0 2 Uptake 


Guard Cell Stomata 
Control Systems 


Allen et al., Plant Cell 
(1999) 11(9): 1785-1798 
Ichida et al., Plant Cell 
(1997) 9(10): 1843-1857 
Hedrich et al., EMBO J 
(1993) 12(3): 897-901 


Coordination Of Carbon 
And Nitrogen Metabolism 


Light-Regulation Of Major 
Central Carbon And 
Nitrogen Metabolic 
Pathways To Coordinate 
Growth 


Lam et al. (1998) Plant J. 
16(3): 345-353 
Lejay et al. (1999) Plant J. 
18(5): 509-519; and 
Oliveira et al. (1999) Plant. 
Phys. 121:301-309 




Carbohydrate And 


Lam et al. (1998) supra; 
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GENERAL CATEGORY 


BIOCHEMICAL OR 
METABOLIC 
ACTIVITIES AND/OR 
PATHWAYS 


CITATIONS INCLUDING 
ASSAYS 




Nitrogen Control Of 
Carbohydrate And Organic 
Nitrogen Accumulation 
Pathways 


Lejay et al. (1999) supra; 
and 

Oliveira et al. (1999) supra 



Other biological activities that can be modulated by the CO2 responsive genes and 
gene products are listed in the Reference tables. Assays for detecting such biological 
activities are described in the Protein Domain table. 

CO2 responsive genes are characteristically differentially transcribed in response to 
fluctuating CO2 levels or concentrations, whether internal or external to an organism or cell. 
The MA_diff tables report the changes in transcript levels of various C0 2 responsive genes 
that are differentially expressed in response to high CO2 levels. 

Profiles of these different CO2 responsive genes are shown in the Table below with 
examples of associated biological activities. 



TRANSCRIPT 


TYPE OF 


PHYSIOLOGICAL 


EXAMPLES OF 


LEVELS 


GENES 


CONSEQUENCES 


BIOCHEMICAL 








ACTIVITY 


Up Regulated 


Responders 


Changes In Generation Of 


• Transporters 


Transcripts 


To Higher 


ATP 


. Catabolic And 




Levels Of 


Changes In Catabolism 


Anabolic Enzymes 




C0 2 


And Anabolism Enzymes 


. Change In Cell 






and Pathways 


Membrane Structure 




Genes 


• Activation Of Krebs Cycle 


And Potential 




Induced By 


Specific Gene 


• Kinases And 




C0 2 


Transcription Initiation 


Phosphatases 






Changes In Carbohydrate 


Transcription 






Synthesis 


Activators And 
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TRANSCRIPT 


TYPE OF 


PHYSIOLOGICAL 


EXAMPLES OF 


LEVELS 


GENES 


CONSEQUENCES 


BIOCHEMICAL 








ACTIVITY 






. Changes In Chloroplast 


Repressors 






Structure 


Change In 






• Changes In Photosynthesis 


Chromatin Structure 






Changes In Respiration 


And/Or Localized 








DNA Topology 








• Redox Control 


Down- 


Responders 


Changes In Pathways And 


Transcription 


Regulated 


To Higher 


Processes Operating In 


Factors 


Transcripts 


Levels Of 


Cells 


• Change In Protein 




C0 2 




Structure By 






• Changes In Catabolism and 


Phosphorylation 




Genes 


Anabolism 


(Kinases) Or 




Repressed By 




Dephosphorylation 




C0 2 


• Changes in Chloroplast 


(Phosphatases) 






Structure 


• Change In 








Chromatin Structure 








And/Or DNA 








Topology 








• Stability Of Factors 








For Protein 








Synthesis And 








Degradation 








• Metabolic Enzymes 



Use of Promoters of CQ2 responsive Genes 

Promoters of C02 responsive genes are useful for transcription of any desired 
polynucleotide or plant or non-plant origin. Further, any desired sequence can be transcribed 
in a similar temporal, tissue, or environmentally specific patterns as the C02 responsive 
genes where the desired sequence is operably linked to a promoter of a C02 responsive gene. 
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The protein product of such a polynucleotide is usually synthesized in the same cells, in 
response to the same stimuli as the protein product of the gene from which the promoter was 
derived. Such promoter are also useful to produce antisense mRNAs to down-regulate the 
product of proteins, or to produce sense mRNAs to down-regulate mRNAs via sense 
suppression. 
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m.E. MITOCHONDRIA ELECTRON TRANSPORT (RESPIRATION) GENES, 
GENE COMPONENTS AND PRODUCTS 

One means to alter flux through metabolic pathways is to alter the levels of proteins in 
the pathways. Plant mitochondria contain many proteins involved in various metabolic 
processes, including the TCA cycle, respiration, and photorespiration and particularly the 
electron transport chain (mtETC). Most mtETC complexes consist of nuclearly-encoded 
mitochondrial proteins (NEMPs) and mitochondrially-encoded mitochondrial proteins 
(MEMPs). NEMPs are produced in coordination with MEMPs of the same complex and 
pathway and with other proteins in multi- organelle pathways. Enzymes involved in 
photorespiration, for example, are located in chloroplasts, mitochondria, and peroxisomes and 
many of the proteins are nuclearly-encoded. Manipulation of the coordination of protein 
levels within and between organelles can have critical and global consequences to the growth 
and yield of a plant. Genes which are manipulated by interfering with the mtETC have been 
characterized using microarray technology. 

Microarray technology allows monitoring of gene expression levels for thousands of 
genes in a single experiment. This is achieved by hybridizing labeled fluorescent cDNA pools 
to glass slides that contain spots of DNA (Schena et al. (1995) Science 270: 467-70). The US 
Arabidopsis Functional Genomics Consortium (AFGC) has recently made public the results 
from such microarray experiments conducted with AFGC chips containing about 10,000 non- 
redundant ESTs, selected from about 37,000 randomly sequenced ESTs generated from 
mRNA of different tissues and developmental stages. 

The sequences of the ESTs showing at least two-fold increases or decreases in the 
presence of the ETC inhibitor, 10 mM antimycin A compared with the control lacking 
antimycin A. were identified, compared to the Ceres full length cDNA and genomic sequence 
databanks, and equivalent Ceres clones identified. The MA_diff table reports the results of 
this analysis, indicating those Ceres clones which are up or down regulated over controls, 
thereby indicating the Ceres clones that represent respiration responsive genes. 

Examples of genes and gene products that are responsive to antimycin A block of 
respiration are shown in the Reference, Sequence, Protein Group, Protein Group Matrix, 
MA_diff and MA_clust tables. While respiration responsive polynucleotides and gene 
products can act alone, combinations of these polynucleotides also affect growth and 
development. Useful combinations include different respiration responsive polynucleotides 
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and/or gene products that have similar transcription profiles or similar biological activities, 
and members of the same or similar biochemical pathways. Here, in addition to 
polynucleotides having similar transcription profiles and/or biological activities, useful 
combinations include polynucleotides that may have different transcription profiles but which 
participate in common or overlapping pathways. Whole pathways or segments of pathways 
are controlled by transcription factor proteins and proteins controlling the activity of signal 
transduction pathways. Therefore, manipulation of such protein levels is especially useful for 
altering phenotypes and biochemical activities of plants. Manipulation of one or more 
respiration responsive gene activities are useful to modulate the biological processes and/or 
phenotypes listed below. 

Such respiration responsive genes and gene products can function to either increase or 
dampen the phenotypes or activities below. Further, promoters of respiration responsive 
genes, as described in the Reference tables, for example, are useful to modulate transcription 
that is induced by respiration or any of the following phenotypes or biological activities 
below. Further, any desired sequence can be transcribed in similar temporal, tissue, or 
environmentally specific patterns as the respiration responsive genes when the desired 
sequence is operably linked to a promoter of a respiration responsive gene. The MA diff 
Table(s) reports the transcript levels of the experiment (see EXPT ED: Mitchondria-Electron 
Transport (relating to SMD 8061, SMD 8063)). For transcripts that had higher levels in the 
samples than the control, a "+" is shown. A is shown for when transcript levels were 
reduced in root tips as compared to the control. For more experimental detail see the 
Example section below. 

Mitchondria-Electron Transport genes are those sequences that showed differential 
expression as compared to controls, namely those sequences identified in the MA_diff tables 
with a "+" or "-" indication 

Mitchondria-Electron Transport Genes Identified By Cluster Analyses Of Differential 
Expression 

Mitchondria-Electron Transport Genes Identified By Correlation To Genes 
That Are Differentially Expressed 

As described above, the transcription profiles of genes that act together are 
well correlated. Applicants not only have identified the genes that are differentially 
expressed in the microarray experiments, but also have identified the genes that act in concert 
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with them. The MA_clust table indicates groups of genes that have well correlated 
transcription profiles and therefore participate in the same pathway or network. 

A pathway or network of Mitchondria-Electron Transport genes is any group 
in the MA_clust that comprises a cDNA ID that also appears in Expt ED Mitchondria- 
Electron Transport (relating to SMD 8061, SMD 8063) of the MA_diff table(s). 

Mitchondria-Electron Transport Genes Identified By Correlation To Genes 
That Cause Physiological Consequences 

Additionally, the differential expression data and the phenotypic observations 
can be merged to identify pathways or networks of Mitchondria-Electron Transport genes. A 
group in the MA_clust is considered a Mitchondria-Electron Transport pathway or network if 
the group comprises a cDNA ID that also appears in Knock-in or Knock-out tables that 
causes one or more of the pheno types described in section above. 

Mitchondria-Electron Transport Genes Identified By Amino Acid Sequence 
Similarity 

Mitchondria-Electron Transport genes from other plant species typically 
encode polypeptides that share amino acid similarity to the sequences encoded by corn and 
Arabidopsis Mitchondria-Electron Transport genes. Groups of Mitchondria-Electron 
Transport genes are identified in the Protein Group table. In this table, any protein group that 
comprises a peptide ID that corresponds to a cDNA ID member of a Mitchondria-Electron 
Transport pathway or network is a group of proteins that also exhibits Mitchondria-Electron 
Transport functions/utilities. 



m.E.l.USE OF RESPIRATION RESPONSIVE GENES TO MODULATE 
PHENOTYPES 

Respiration responsive genes and gene products are useful to or modulate one or more 
of the following phenotypes: 

- Catabolism 

- Energy Generation, ATP, etc. 

- - Growth Rate 

- Whole Plant, including Height, Flowering Time, etc. 
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- Organs 

- Flowers 

- Fruits 

- Stems 
5 - Leaves 

- Roots 

- Lateral Roots 

- Biomass 

- Fresh and Dry Weight during any time in plant life, such as maturation; 
10 - Number, Size, and Weight of 

Flowers; 

- Seeds; 

- Branches; 

- Leaves; 

f5 - Total Plant Nitrogen Content 

21 - Amino Acid/Protein Content of Whole Plant or Parts 

^ - Seed Yield 

Si - Number, Size, Weight, Harvest Index 

- Content and Composition, e.g., Amino Acid, Nitrogen, Oil, Protein, and 
§S0 Carbohydrate 

- Fruit Yield 

£T - Number, Size, Weight, Harvest Index 

- Content and Composition, e.g., Amino Acid, Nitrogen, Oil, Protein, Carbohydrate, 
Water 

25 - Photosynthesis 

- Carbon dioxide fixation 
To improve any of the phenotype(s) above, activities of one or more of the respiration 
responsive genes or gene products can be modulated and the plants tested by screening for 
the desired trait. Specifically, the gene, mRNA levels, or protein levels can be altered in a 
30 plant utilizing the procedures described herein and the phenotypes can be assayed. As an 
example, a plant can be transformed according to Bechtold and Pelletier (1998, Methods. 
Mol. Biol. 82:259-266) and/or screened for variants as in Winkler et al. (1998) Plant Physiol 
118: 743-50 and visually inspected for the desired phenotype or metabolically and/or 
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functionally assayed according to Saito et al. (1994, Plant Physiol. 106: 887-95), Takahashi et 
al (1997, Proc. Natl. Acad. Sci. USA 94: 1 1 102-07) and Koprivova et al. (2000, Plant 
Physiol. 122: 737-46). 

m.E.2. USE OF RESPIRATION-RESPONSIVE GENES TO MODULATE 
BIOCHEMICAL ACTIVITIES 
The activities of one or more of the respiration responsive genes can be modulated to 
change biochemical or metabolic activities and/or pathways such as those noted below. Such 
biological activities can be measured according to the citations included in the Table below: 



PROCESS 


BIOCHEMICAL OR 
METABOLIC ACTIVITIES 
AND/OR PATHWAYS 


CITATIONS 
INCLUDING ASSAYS 


Respiration and 

energy-related 

processes 


Mitochondrial Electron 
Transport Chain 


Passam et al. (1973) 
Biochem Biophys. Acta 
325: 54-61 




Alternative oxidase pathway 


Saishoetal. (1997) Plant 
Mol. Biol. 35: 585-600 
Vanlerberghe and 
Mcintosh (1994) Plant 
Physiol. 105: 867-874 




ATP generation pathways 
ATP utilization pathways 


Mahler and Cordes (1966) 
In Biological Chemistry, 
Harper and Row 




Chloroplast energy related 
pathways 


Foyer etal. (1989) Arch. 
Biochem. Biophys. 268: 
687-697 

Mills et al. (1978) 
Biochem. Biophys. Acta 
504: 298-309 




Peroxisome energy related 
pathways 


01sen(1998) Plant mol. 
Biol. 38: 163-89 




Cytoplasmic energy related 


Roberts et al. (1995) Febs Letters 
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PROCESS 


BIOCHEMICAL OR 
METABOLIC ACTIVITIES 
AND/OR PATHWAYS 


CITATIONS 
INCLUDING ASSAYS 




pathways 


373: 307-309 


* 


Catabolism and Anabolism 


Mahler and Cordes (1966) In 
Biological Chemistry, Harper and 
Row 




Aerobic versus anaerobic 
pathways 


Mahler and Cordes (1966) In 
Biological Chemistry, Harper and 
Row 


Coordination of 
Carbon and Nitrogen 
Metabolism 


Light-regulation of maj or 
central carbon and nitrogen 
Metabolic pathways to 
coordinate growth 


Lam et ah (1998) Plant J. 16(3): 
345-353 

Lejay et al. (1999) Plant J. 18(5): 
509-519; and 

Oliveira et al. (1999) Plant. Phys. 
121:301-309 




Carbohydrate and nitrogen 
control of carbohydrate and 
organic nitrogen accumulation 
pathways 


Lam et al (1998) Plant J. 16(3): 
345-353 

Lejay et al. (1999) Plant J. 18(5): 
509-519; and 

Oliveira et al. (1999) Plant. Phys. 
121:301-309 



Other biological activities that can be modulated by the respiration genes and gene 
products are listed in the REF Tables. Assays for detecting such biological activities are 
described in the Protein Domain table. 

Respiration responsive genes are differentially expressed in response to inhibition of 
mitochondrial electron transport by antimycin A. The MA_diff table reports the changes in 
transcript levels of various respiration responsive genes that are differentially expressed in 
response to this treatment. 

Profiles of these different respiration genes are shown in the Table below with 
examples of associated biological activities. 



Page 349 of 772 




Attorney L)kt. 2731M48JF 
Client Dkt. 80090.005 



TRANSCRIPT 


TYPE OF GENES 


PHYSIOLOGICAL 


EXAMPLES OF 


LEVELS 




CONSEQUENCES 


BIOCHEMICAL 








ACTIVITY 


Up regulated 


Responders to 


Changes in 


• Transporters 


transcripts 


inhibition of 


generation of ATP 


• Catabolic and 




mitochondrial 


Alternate oxidase 


anabolic enzymes 




electron transport 


induction 


• Changes in cell 




respiration 


Changes in 


and organelle 






catabolic and 


membrane 






anabolic enzymes 


structures and 




Genes induced by 


and pathways 


potentials 




inhibition of 


Specific gene 


• Kinases and 




mitochondrial 


transcription 


phosphatases 




electron transport 


initiation 


• Transcription 






Changes in electron 


activators 






transport proteins 


• Change in 








chromatin 








structure and/or 








localized DNA 








topology 








• Redox control 


Down-regulated 


Responders to 


• Changes in ATP 


• Transcription 


transcripts 


inhibition of 


generating 


factors 




mitochondrial 


pathways 


. Change in protein 




electron transport 




structure by 






Changes in 


phosphorylation 




Genes repressed by 


pathways and 


(kinases) or 




inhibition of 


processes operating 


dephosphoryaltion 




mitochondrial 


in cells 


(phosphatases) 




electron transport 




Transporters 






Induction of 


• Catabolic and 
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TRANSCRIPT 
LEVELS 


TYPE OF GENES 


PHYSIOLOGICAL 
CONSEQUENCES 


EXAMPLES OF 
BIOCHEMICAL 
ACTIVITY 






aerobic pathways 


anabolic enzymes 








• Changes in cell 






• Changes in 


and organelle 






catabolism and 


membrane 






anabolism 


structures and 








potentials 








Change in 








chromatin 








structure and/or 








localized DNA 








topology 








• changes 








• Stability factors 








for protein 








synthesis and 








degradation 








• Metabolic 








enzymes 






Changes in redox 
activities 


• Changes in redox 
enzymes 



Use of Promoters of Respiration Genes 

Promoters of Respiration genes are useful for transcription of any desired 
polynucleotide or plant or non-plant origin. Further, any desired sequence can be transcribed 
in a similar temporal, tissue, or environmentally specific patterns as the Respiration genes 
where the desired sequence is operably linked to a promoter of a Respiration gene. The 
protein product of such a polynucleotide is usually synthesized in the same cells, in response 
to the same stimuli as the protein product of the gene from which the promoter was derived. 
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Such promoter are also useful to produce antisense mRNAs to down-regulate the product of 
proteins, or to produce sense mRNAs to down-regulate mRNAs via sense suppression. 
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m.F. PROTEIN DEGRADATION GENES, GENE COMPONENTS AND 
PRODUCTS 

One of the components of molecular mechanisms that operate to support plant 
development is the "removal" of a gene product from a particular developmental circuit once 
the substrate protein is not functionally relevant anymore in temporal and/or spatial contexts. 
The "removal" mechanisms can be accomplished either by protein inactivation (e.g., 
phosphorylation or protein-protein interaction) or protein degradation most notably via 
ubiquitination-proteasome pathway. The ubiquitination-proteasome pathway is responsible 
for the degradation of a plethora of proteins involved in cell cycle, cell division, transcription, 
and signal transduction, all of which are required for normal cellular functions. 
Ubiquitination occurs through the activity of ubiquitin- activating enzymes (El), ubiquitin- 
conjugating enzymes (E2), and ubiquitin-protein ligases (E3), which act sequentially to 
catalyze the attachment of ubiquitin (or other modifying molecules that are related to 
ubiquitin) to substrate proteins (Hochstrasser 2000, Science 289: 563). Ubiquitinated 
proteins are then routed to proteasomes for degradation processing [2000, Biochemistry and 
Molecular Biology of Plants, Buchanan, Gruissem, and Russel (eds), Amer. Soc. of Plant 
Physiologists, Rockville, MD], The degradation mechanism can be selective and specific to 
the concerned target protein (Joazeiro and Hunter2001, Science 289: 2061; Sakamoto et al., 
2001, PNAS Online 141230798). This selectivity and specificity may be one of the ways that 
the activity of gene products is modulated. 



"Protein degradation" genes identified herein are defined as genes, gene components 
and products associated with or dependant on the ubiquitination - proteasome protein 
degradation process. . Examples of such "protein degradation" genes and gene products are 
shown in the Reference and Sequence Tables. The biochemical functions of the protein 
products of many of these genes are also given in the Reference, Sequence, Protein Group, 
Protein Group Matrix tables, MA_diff and MA_clust tables. Selected genes, gene 
components and gene products of the invention can be used to modulate many plant traits 
from architecture to yield to stress tolerance. 



m.F.l. 



IDENTIFICATION OF PROTEIN DEGRADATION GENES, GENE 



COMPONENTS AND PRODUCTS 
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"Protein Degradation" Genes, Gene Components And Products Identified By 
Phenotypic Observations 

"Protein degradation" genes herein were discovered and characterized from a much 
larger set of genes in experiments designed to find the genes associated with the increased 
number of lateral branches (and secondary inflorescences) that are formed per cauline node. 
In these experiments, "protein degradation" genes were identified using a mutant with these 
characteristics. The gene causing the changes was identified from the mutant gene carrying 
an inserted tag. The mutant plant was named 1 3B 1 2- 1 and the mutant was in the E2 
conjugating enzyme gene of the ubiquitination process. Compared to "wild-type" parental 
plants, the mutant plants exhibited multiple lateral stems per node and multi-pistillated 
flowers. For more experimental detail, see Example section below. 

Protein Degradation Genes, Gene Components And Products Identified By 
Differential Expression 

"Protein degradation" genes were also identified by measuring the relative levels of 
mRNA products in the mutant plant 13B12-1 lacking the E2 conjugating enzyme versus a 
"wild-type" parental plant. Specifically, mRNAs were isolated from 13B12-1 and compared 
with mRNAs isolated from wild-type plants utilizing microarray procedures. The MA_diff 
Table(s) reports the transcript levels of the experiment (see EXPT ID: 108451). For transcripts 
that had higher levels in the samples than the control, a "+" is shown. A "-" is shown for when 
transcript levels were reduced in root tips as compared to the control. For more experimental 
detail see the Example section below. 

Protein Degradation genes are those sequences that showed differential expression as 
compared to controls, namely those sequences identified in the MA_diff tables with a "+" or "-" 
indication 

Protein Degradation Genes Identified By Cluster Analyses Of Differential Expression 
Protein Degradation Genes Identified By Correlation To Genes That Are 

Differentially Expressed 

As described above, the transcription profiles of genes that act together are well 

correlated. Applicants not only have identified the genes that are differentially expressed in the 
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microarray experiments, but also have identified the genes that act in concert with them. The 
MA_clust table indicates groups of genes that have well correlated transcription profiles and 
therefore participate in the same pathway or network. 

A pathway or network of Protein Degradation genes is any group in the 
5 MA_clust that comprises a cDNA ID that also appears in Expt ID 10845 1 of the MA_diff 
table(s). 

Protein Degradation Genes Identified By Correlation To Genes That Cause 
Physiological Consequences 

10 Additionally, the differential expression data and the phenotypic observations 

can be merged to identify pathways or networks of Protein Degradation genes. A group in the 
MA_clust is considered a Protein Degradation pathway or network if the group comprises a 
cDNA ID that also appears in Knock- in or Knock-out tables that causes one or more of the 

O phenotypes described in section above. 

% Protein Degradation Genes Identified By Amino Acid Sequence Similarity 

fy Protein Degradation genes from other plant species typically encode 

h\ polypeptides that share amino acid similarity to the sequences encoded by corn and Arabidopsis 

L_ Protein Degradation genes. Groups of Protein Degradation genes are identified in the Protein 

U 

f£0 Group table. In this table, any protein group that comprises a peptide ID that corresponds to a 
\» cDNA ID member of a Protein Degradation pathway or network is a group of proteins that also 
w exhibits Protein Degradation functions/utilities. 

These differentially expressed genes include genes associated with the degradation 

process and the genes whose expression is disturbed by the aberrant ubiquitination. 
25 Examples of phenotypes, biochemical activities, and transcription profiles that can be 

modulated using these genes, gene components and gene products are described above and 

below. 
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m.F.2. USE OF "PROTEIN DEGRADATION" GENES, GENE 

COMPONENTS AND PRODUCTS TO MODULATE PHENOTYPES 

The "protein degradation" genes, their components and products of the instant 
invention are useful for modulating one or more processes required for post-translational 
modification (e.g., ubiquitination) and degradation or inactivation of substrate proteins and 
also the pathways and processes that are associated with protein inactivation that are 
important for either or all of the following: (I) cell proliferation; (II) cell differentiation; and 
(HI) cell death. The "protein degradation" genes, gene components and gene products are 
useful to alter or modulate one or more of the following phenotypes: 

I. Cell Proliferation 

A. Cell properties 

Cell properties can be critically altered by the maintenance or not of 
regulatory proteins 

1. Cell size 

2. Cell division, rate and direction 

3. Cell elongation 

B. Plant size 

The following parts of a plant can be modulated by "protein degradation" genes, gene 
components or gene products to affect plant size: 



Roots 




(a) 


Primary 


(b) 


Lateral 


(c) 


Root hairs 


(d) 


Root cap 


(e) 


Apical meristem 


(f) 


Epidermis 


(g) 


Cortex 


(h) 


Stele 


Stem 
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(a) 


Phloem 


(b) 


Xylem 


(c) 


Nodes 


(d) 


Internodes 


(e) 


Leaves 


(f) 


Shoot apical meristem 


(g) 


Cauline 


(h) 


Rosette 


(0 


Petioles 


Flowers 


(a) 


Receptacle 


(b) 


Sepals, Petals, and Tepals 


(c) 


Androecium 


(d) 


Stamen 


(e) 


Anther 


(f) 


Pollen 


(g) 


Filament 


(h) 


Gynoecium 


(i) 


Carpel 


0) 


Ovary 


(k) 


Style 


(1) 


Stigma 


(m) 


Ovule 


(n) 


Pedicel and Peduncle 


Seeds 




(a) 


Placenta 


(b) 


Embryo 


(c) 


Cotyledon 


(d) 


Endosperm 


(e) 


Suspensor 


(f) 


Seed coat (testa) 


Fruits 




(a) 


Pericarp - thickness, textu 
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(b) Exocarp 

(c) Mesocatp 

(d) Encocarp 

n. CELL DIFFERENTIATION 
The intracellular levels of many proteins are regulated by ubiquitin-proteasome 
proteolysis. Without proper regulation of protein levels, normal cell differentiation can be 
altered. Examples of cell differentiation and development that are useful to modulated by the 
genes and gene products of this invention are as follows: 

A. Roots 

The polynucleotides and polypeptides of this invention can be used to control root 
structure and function. Examples are as follows: 

1. Size 

(a) Length of primary roots 

(b) Length of lateral roots 

2. Function (for more detail see Root section) 

B. Branching and stem formation 

1. Multiple pistils 

2. Multiple lateral stems or secondary inflorescence per cauline 
node 

3. Internode length 

C. Cell differentiation and/or development in response to hormones 
1 . Auxin 

IE. CELL DEATH 

Programmed cell death can result from specific and targeted degradation of critical 
substrate proteins (e.g., transcription factors, enzymes, and proteins involved in signal 
transduction). Thus, alteration of "protein degradation" genes, their gene products, and the 
corresponding substrate proteins that they are acting upon are useful to modulate the vigor 
and yield of the plant overall as well as distinct cells, organs, or tissues. Examples of traits 
that can be modulated by these genes and gene products include: 

A. Sterility or reproduction 

B. Seedling lethality 
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Uses Of Plants That Are Modified As Described Above 
Genes that control fundamental steps in regulatory pathways, such as protein 
inactivation, that in turn influence cascades and networks of other genes and processes are 
5 extremely useful. They and their component parts can be used selectively to manipulate 
development in specific cells, tissues and organs, including meristems when genes are 
designed to inactivate the normal genes only in specific cells, tissues and organs or to 
promote protein production where it is not normally produced. They can also be used to 
promote/control cell death. 
10 Other "protein degradation" genes described here are components of the pathways 

determining organ identity and phenotypes. These and their component parts are also useful 
for modifying the characteristics of specific cells, tissues and organs when regulated 
appropriately. Thus "protein degradation" genes have wide utility for achieving the 

Better plant survival by decreased lodging 
Better responses to high plant density 
Better stress tolerance 

Better animal (including human) nutrition values 
Improved dietary mineral nutrition 
More vigor, growth rate and yield 

1 . Growth rate 

(a) Whole plant, including height, flowering time, branching, etc. 

(b) Seedling 

(c) Coleoptile elongation 
25 (d) Young leaves 

(e) Flowers 

(f) Seeds 

(g) Fruit 

2. Yield 

30 (a) Biomass 

- Fresh and dry weight during any time in plant life, including maturation and 
senescence 

(b) Root/tuber yield 
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- Number, size, weight, harvest index 
- Content and composition, e.g. amino acid, jasmonate, oil, protein and starch 

(c) Number of flowers 

(d) Seed yield 

Number, size, weight, harvest index 

Content and composition, e.g. amino acid, jasmonate, oil, 

protein and starch 

(e) Fruit yield 

- Number, size, weight, harvest index, post harvest quality 
Content and composition, e.g. amino acid, jasmonate, oil, 
protein and starch 

To regulate any of the phenotype(s) above, activities of one or more of the "protein 
degradation" genes or gene products can be modulated and tested by screening for the desired 
trait. Specifically, the gene, mRNA levels, or protein levels can be altered in a plant utilizing 
the procedures described herein and the phenotypes can be assayed. In addition, a synthetic 
molecule containing specific domains from "protein degradation" genes or gene product 
and/or in combination with other domains from gene products that are not necessarily related 
to protein degradation pathway can be constructed to target the degradation or inactivation of 
specific substrate proteins. As an example, a plant can be transformed according to Bechtold 
and Pelletier (1998, Methods. Mol. Biol. 82:259-266) and/or screened for variants as in 
Winkler et al. (1998) Plant Physiol 118: 743-50 and visually inspected for the desired 
phenotype or metabolically and/or functionally assayed according to Dolan et al (1993, 
Development 119: 71-84), Dolan et al. (1997, Development 124: 1789-98), Crawford and 
Glass (1998, Trends Plant Science 3: 389-95), Wang et al. (1998, PNAS USA 95: 15134-39), 
Gaxiola et al. (1998, PNAS USA 95: 4046-50), Apse et al. (1999, Science 285: 1256-58), 
Fisher and Long (1992, Nature 357: 655-60), Schneider et al. (1998, Genes Devel 12: 2013- 
21) and Hirsch (1999, Curr Opin Plant Biol 2: 320-326). 
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m.F.3. USE OF PROTEIN DEGRADATION GENES, GENE 
COMPONENTS AND PRODUCTS TO MODULATE 
BIOCHEMICAL ACTIVITIES 
One or more of the "protein degradation" genes and their components can be used to 
modulate biochemical or metabolic activities, processes and/or pathways such as those noted 
below. Such biological activities can be measured according to the citations included in the 
Table below: 



PROCESS 


BIOCHEMICAL OR 
METABOLIC ACTIVITIES 
AND/OR PATHWAYS 


CITATIONS INCLUDING 
ASSAYS 


Growth, Differentiation and 
Development 


Auxin response 


Schwechheimer et al, Science 292: 
1379 (2001); 

Leyser et al, Nature 8: 161 (1993); 
Lasswell et al, Plant Cell 12: 2395 
(2000) 




Photomorphogenesis via leaf 
cells and meristems 


Schwechheimer et al, Science 292: 
1379 (2001) 




Apical dominance via shoot 
meristems 


Schwechheimer et al, Science 292: 
1379 (2001) 




Lateral root development via root 
meristem 


Xie et al, Genes Dev 14: 3024 
(2000) 




Hypocotyl, shoot elongation by 
hormone controlled process 


Nagpal et al, Plant Physiol 123: 563 
(2000) 


Gene Expression and related 
cellular processes 


mRNA stability 


Johnson et al, PNAS 97: 13991 
(2000); 


Gene activation 


Pham and Sauer, 289: 2357 (2000) 


Cell division and cell cycle 
control in meristems 


King et al, Cell 81: 279 (1995); 
Ciechanover et al, Cell 37: 57 
(1984); 

Finley et al, Cell 37: 43 (1984); 
Robzyk et al, Science 287: 501 
(2000) 
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PROCESS 


BIOCHEMICAL OR 


CITATIONS INCLUDING 




METABOLIC ACTIVITIES 


ASSAYS 




AND/OR PATHWAYS 






Chromatin remodeling 


Roest et al, Cell 86: 799 (1996) 




Post-translational modification 


Biederer et al, Science 278: 1806 




and organelle targeting of 


(1997) 




proteins 





Other biological activities that can be modulated by the "protein degradation 11 gene, 
gene components and products are listed in the Reference tables. Assays for detecting such 
biological activities are described in the Protein Domain table. 

m.F.4. USE OF PROTEIN DEGRADATION GENES, GENE 
COMPONENTS AND PRODUCTS TO MODULATE 
TRANSCRIPTION LEVELS OF OTHER GENES 
The expression of many genes is "up regulated 55 or "down regulated" in the 13B 12-1 
mutant because some protein degradation genes and their products are integrated into 
complex networks that regulate transcription of many other genes. Some protein degradation 
genes are therefore useful for modifying the transcription of other genes and hence complex 
phenotypes, as described above. Profiles of "protein degradation" genes are described in the 
Table below with associated biological activities. "Up-regulated 5 ' profiles are those where 
the gene produces mRNA levels that are higher in the 13B12-1 as compared to wild-type 
plant; and vice-versa for "down-regulated 55 profiles. 
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TRANSCRIPT 


TYPE OF GENES 


PHYSIOLOGICAL 


EXAMPLES OF 


LEVELS 


WHOSE 


CONSEQUENCES OF 


BIOCHEMICAL 




TRANSCRIPTS 


MODIFYING GENE 


ACTIVITIES WHOSE 




ARE CHANGED 


PRODUCT LEVELS 


TRANSCRIPTS ARE 








CHANGED 


Up Regulated 


. Genes induced as 


Shoot formation 


• Transcription 


Transcripts 


a consequence of 


Lateral stem, lateral 


Activators and 




mutant 


and main 


Repressors 




ubiquitination 


inflorescence 


Chromatin Structure 




degradation 


development 


and/or Localized 




system 


• Internode 


DNA Topology 




Genes repressed 


elongation 


determining proteins 




by "protein 


• Node determination 


. Methylated DNA 




degradation" 


and development 


binding proteins 




system directly or 


• Root formation 


Kinases, 




indirectly 


Lateral root 


Phosphatases 






development 


Signal transduction 




Genes repressed 


Proper response to 


pathway proteins 




or mRNAs 


Auxin and other 


• Transporters 




degraded as a 


growth regulators 


• Metabolic Enzymes 




consequence of 


• Seed dormancy and 


Cell cycle 




mutant 


seed development 


checkpoint proteins 




ubiquitination 


• Resistance to 


Cell Membrane 




degradation 


drought and other 


Structure And 




process 


forms of stress 


Proteins 






• Secondary 


. Cell Wall Proteins 






metabolite 


Proteins involved in 






biosynthesis 


secondary 








metabolism 








• Seed storage 








metabolism 
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TRANSCRIPT 
LEVELS 


TYPE OF GENES 
WHOSE 
TRANSCRIPTS 
ARE CHANGED 


PHYSIOLOGICAL 
CONSEQUENCES OF 
MODIFYING GENE 
PRODUCT LEVELS 


EXAMPLES OF 
BIOCHEMICAL 
ACTIVITIES WHOSE 
TRANSCRIPTS ARE 
CHANGED 


Down Regulated 
Transcripts 


• Genes activated 
by "protein 
degradation" 
systems directly 
or indirectly 







"Protein degradation" genes and gene products can be modulated alone or in 
combination as described in the introduction. Of particular interest are combination of these 
genes and gene products with those that modulate hormone responses and/or metabolism. 
Hormone responsive and metabolism genes and gene products are described in more detail in 
the sections above. Such modification can lead to major changes in plant architecture and yield. 

Use Of Promoters And "Protein Degradation Genes, Gene Components And Products" 

Promoters of "protein degradation" genes, as described in the Reference tables, for 
example, can be used to modulate transcription of any polynucleotide, plant or non plant to 
achieve synthesis of a protein in association with production of the ubiquitination -proteasome 
pathway or the various cellular systems associated with it. Additionally such promoters can be 
used to synthesize antisense RNA copies of any gene to reduce the amount of protein product 
produced, or to synthesize RNA copies that reduce protein formation by RNA interference. 
Such modifications can make phenotypic changes and produce altered plants as described 
above. 
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m.G. CAROTENOGENESIS RESPONSIVE GENES, GENE COMPONENTS 
AND PRODUCTS 

Carotenoids serve important biochemical functions in both plants and animals. In 
plants, carotenoids function as accessory light harvesting pigments for photosynthesis and to 
protect chloroplasts and photosystem II from heat and oxidative damage by dissipating 
energy and scavenging oxygen radicals produced by high light intensities and other oxidative 
stresses. Decreases in yield frequently occur as a result of light stress and oxidative stress in 
the normal growth ranges of crop species. In addition light stress limits the geographic range 
of many crop species. Modest increases in oxidative stress tolerance would greatly improve 
the performance and growth range of many crop species. The development of genotypes with 
increased tolerance to light and oxidative stress would provide a more reliable means to 
minimize crop losses and diminish the use of energy-costly practices to modify the soil 
environment. 

In animals carotenoids such as beta-carotene are essential provitamins required for 
proper visual development and function. In addition, their antioxidative properties are also 
thought to provide valuable protection from diseases such as cancer. Modest increases in 
carotenoid levels in crop species could produce a dramatic effect on plant nutritional quality. 
The development of genotypes with increased carotenoid content would provide a more 
reliable and effective nutritional source of Vitamin A and other carotenoid derived 
antioxidants than through the use of costly nutritional supplements. 

Genetic changes produced through DNA mutation in a plant can result in the 
modulation of many genes and gene products. Examples of such mutation altered genes and 
gene products are shown in the Reference and Sequence Tables. These genes and/or products 
are responsible for effects on traits such as plant vigor, nutritional content and seed yield. 

While carotenoid synthesis and/or oxidative stress responsive polynucleotides and 
gene products can act alone, combinations of these polynucleotides also affect growth and 
development. Useful combinations include different carotenoid biosynthetic polynucleotides 
and/or gene products that have similar transcription profiles or similar biological activities, 
and members of the same or similar biochemical pathways. In addition, the combination of 
an carotenoid synthesis or oxidative stress protective polynucleotide and/or gene product with 
another environmentally responsive polynucleotide is also useful because of the interactions 
that exist between hormone regulated pathways, stress pathways, nutritional pathways and 
development. Here, in addition to polynucleotides having similar transcription profiles 
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and/or biological activities, useful combinations include polynucleotides that may have 
different transcription profiles but which participate in a common pathway. 

Such carotenoid synthesis/oxidative stress tolerance genes and gene products can 
function to either increase or dampen the above phenotypes or activities either in response to 
5 changes in light intensity or in the absence of osmotic fluctuations. They were discovered and 
characterized from a much larger set of genes by experiments designed to find genes whose 
mRNA products participate in carotenogenesis. These experiments made use of an 
Arabidopsis mutant (Or) having an accumulation of up to 500 times more beta-carotene than 
wild-type in non-photosynthetic tissues. 
10 Microarray technology allows monitoring of gene expression levels for thousands of 

genes in a single experiment. This is achieved by hybridizing labeled fluorescent cDNA pools 
to glass slides that contain spots of DNA (Schena et al. (1995) Science 270: 467-70). The 
USArabidopsis Functional Genomics Consortium (AFGC) has recently made public the 
~£ results from such microarray experiments conducted with AFGC chips containing some 
45 10,000 non-redundant ESTs, selected from about 37,000 randomly sequenced ESTs 
n generated from mRNA of different tissues and developmental stages. 
\ * The sequences of the ESTs showing at least two-fold increases or decreases in the 

CP mutant plant compared with wild type seedlings were identified, compared to the Ceres full 

5 

£3 length cDNA and genomic sequence databanks, and equivalent Ceres clones identified. 

jfp MA_diff Table reports the results of this analysis, indicating those Ceres clones which are up 

- & 

41 or down regulated over controls, thereby indicating the Ceres clones which represent 
r[ Carotenoid synthesis/oxidative stress tolerance responsive genes. The MA_diff Table(s) 

reports the transcript levels of the experiment (see EXPT ID: Cauliflower (relating to SMD 

5329, SMD 5330)). For transcripts that had higher levels in the samples than the control, a 
25 "+" is shown. A "-" is shown for when transcript levels were reduced in root tips as 

compared to the control. For more experimental detail see the Example section below. 

Carotenogenesis genes are those sequences that showed differential expression as 

compared to controls, namely those sequences identified in the MA_diff tables with a "+" or 

"-" indication 

30 
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Caroteno genesis Genes Identified By Cluster Analyses Of Differential 
Expression 

Carcinogenesis Genes Identified By Correlation To Genes That Are 
Differentially Expressed 

As described above, the transcription profiles of genes that act together are 
well correlated. Applicants not only have identified the genes that are differentially 
expressed in the microarray experiments, but also have identified the genes that act in concert 
with them. The MA_clust table indicates groups of genes that have well correlated 
transcription profiles and therefore participate in the same pathway or network. 

A pathway or network of Carotenogenesis genes is any group in the MA_clust 
that comprises a cDNA ID that also appears in Expt ID Cauliflower (relating to SMD 5329, 
SMD 5330) of the MA_diff table(s). 

Carotenogenesis Genes Identified By Correlation To Genes That Cause 
Physiological Consequences 

Additionally, the differential expression data and the phenotypic observations 
can be merged to identify pathways or networks of Carotenogenesis genes. A group in the 
MA_clust is considered a Carotenogenesis pathway or network if the group comprises a 
cDNA ID that also appears in Knock-in or Knock-out tables that causes one or more of the 
phenotypes described in section above. 

Carotenogenesis Genes Identified By Amino Acid Sequence Similarity 
Carotenogenesis genes from other plant species typically encode polypeptides 
that share amino acid similarity to the sequences encoded by corn and Arabidopsis 
Carotenogenesis genes. Groups of Carotenogenesis genes are identified in the Protein Group 
table. In this table, any protein group that comprises a peptide ID that corresponds to a 
cDNA ID member of a Carotenogenesis pathway or network is a group of proteins that also 
exhibits Carotenogenesis functions/utilities. 
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m.G. 1 . USE OF CAROTENOID SYNTHESIS7QXIDATIVE STRESS 

TOLERANCE RESPONSIVE GENES, GENE COMPONENTS AND 
PRODUCTS TO MODULATE PHENOTYPES 
Carotenoid synthesis/oxidative stress tolerance genes and gene products are useful to 
modulate one or more of the following phenotypes: 
Growth rate 

- Whole Plant, including Height, Flowering Time, etc. 
Seedling 

- Organ 

- Stem 

- Leaves 
Roots 

- Flowers 
Fruits 
Seeds 

- Yield 

- Size, Weight 
Seed Development 
Embryo 
Germination 

Cell Differentiation 

- Chloroplasts 

- Plant nutrition 

- Uptake and assimilation of organic compounds 
Uptake and assimilation of inorganic compounds 

- Animal (including human) nutrition 
Improved dietary mineral nutrition 
Stress Responses 

Drought 

- Cold 
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Osmotic 

To improve any of the phenotype(s) above, activities of one or more of the Carotenoid 
synthesis/oxidative stress tolerance genes or gene products can be modulated and tested by 
screening for the desired trait. Specifically, the gene, mRNA levels, or protein levels can be 
altered in a plant utilizing the procedures described herein and the phenotypes can be 
assayed. As an example, a plant can be transformed according to Bechtold and Pelletier 
(1998, Methods. Mol. Biol. 82:259-266) and/or screened for variants as in Winkler et al. 
(1998) Plant Physiol 118: 743-50 and visually inspected for the desired phenotype or 
metabolically and/or functionally assayed according to Friedrich, (1999, JAMA 282: 1508), 
Kumar et al. (1999, Phytochemistry 51: 847-51), La Rocca et al. (2000, Physiologia 
Plantarum 109: 51-7) and Bartley (1994, In: Ann Rev Plant Physiol Plant Molec Biol, Jones 
and Somerville, eds, Annual Reviews Inc, Palo Alto, CA). 

m.G.2. USE OF CAROTENOID SYNTHESIS/OXIDATIVE STRESS 

TOLERANCE RESPONSIVE GENES. GENE COMPONENTS AND 
PRODUCTS TO MODULATE BIOCHEMICAL ACTIVITIES 
The activities of one or more of the carotenoid synthesis/oxidative stress tolerance 
genes can be modulated to change biochemical or metabolic activities and/or pathways such 
as those noted below. Such biological activities can be measured according to the citations 
included in the Table below: 



PROCESS 


BIOCHEMICAL OR 
METABOLIC ACTIVITIES 
AND/OR PATHWAYS 


CITATIONS INCLUDING 
ASSAYS 


Growth , 
Differentiation 
and Development 


Chloroplast biosynthesis 


Kumar et al. (1999) Phytochemistry 
51: 847-51 

Fraser et al. (1994) Plant Physiol 
105: 405-13 


Metabolism 


Carotenoid biosynthesis 


Kumar et al. (1999) Phytochemistry 
51: 847-51 




Herbicide resistance 


La Rocca et al. (2000) Physiolgia 
Plantarum 109: 51-57 




Regulate abscisic acid levels 


Tan et al. (1997) PNAS USA 94: 
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PROCESS 


BIOCHEMICAL OR 
METABOLIC ACTIVITIES 
AND/OR PATHWAYS 


CITATIONS INCLUDING 
ASSAYS 






12235-40 




Drought, cold and osmotic 
tolerance 


Tan et al. (1997) PNAS USA 94: 
12235-40 



Other biological activities that can be modulated by the Carotenoid synthesis, 
oxidative stress tolerance genes and gene products are listed in the Reference Tables. Assays 
for detecting such biological activities are described in the Protein Domain table. 

Profiles of these different carotenoid synthesis/oxidative stress tolerance responsive 
genes are shown in the Table below together with examples of the kinds of associated 
biological activities. 
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TRANSCRIPT 


TYPE OF GENES 


PHYSIOLOGICAL 


EXAMPLES OF 


LEVELS 




CONSEQUENCES 


BIOCHEMICAL 








ACTIVITY 


Up regulated 


Genes induced during 


Gene 


• Transporters 


transcripts 


carotenoid synthesis/ 


Repression/Induction 


• Metabolic 




oxidative stress 


activity 


enzymes 




tolerance activity 


• Cell cycle progression 


• Kinases and 






• Chromatin 


phosphatases 






condensation 


• Transcription 






Synthesis of 


activators 






metabolites and/or 


Change in 






proteins 


chromatin 






• Modulation of 


structure and/or 






transduction pathways 


localized DNA 






• Specific gene 


topology 






transcription initiation 




Down-regulated 


Genes repressed 


• Gene 


Transcription 


transcripts 


during carotenoid 


repression/induction 


factors 




synthesis/oxidative 


activity 


Change in 




stress tolerance 




protein structure 




activity 


Changes in pathways 


by 






and processes 


phosphorylation 




Genes with 


operating in cells 


(kinases) or 




discontinued 




dephosphorylatio 




expression or 

XT 


Changes in 


n (phosphatases) 




unsTable mRNA in 


metabolism other than 


• Change in 




conditions of reduced 


carotenoid 


chromatin 




carotenoid 


synthesis/oxidative 


structure and/or 




synthesis/oxidative 


stress tolerance 


DNA topology 




stress tolerance 




. Stability of 








factors for 








protein synthesis 
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TRANSCRIPT 
LEVELS 


TYPE OF GENES 


PHYSIOLOGICAL 
CONSEQUENCES 


EXAMPLES OF 
BIOCHEMICAL 
ACTIVITY 








and degradation 
• Metabolic 
enzymes 



Use of Promoters of Carotenogenesis Responsive Genes 

Promoters of Carotenogenesis responsive genes are useful for transcription of any 
desired polynucleotide or plant or non-plant origin. Further, any desired sequence can be 
transcribed in a similar temporal, tissue, or environmentally specific patterns as the 
Carotenogenesis responsive genes where the desired sequence is operably linked to a promoter 
of a Carotenogenesis responsive gene. The protein product of such a polynucleotide is usually 
synthesized in the same cells, in response to the same stimuli as the protein product of the gene 
from which the promoter was derived. Such promoter are also useful to produce antisense 
mRNAs to down-regulate the product of proteins, or to produce sense mRNAs to down-regulate 
mRNAs via sense suppression. 
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IV. VIABILITY GENES, GENE COMPONENTS AND PRODUCTS 

IV.A. VIABILITY GENES. GENE COMPONENTS AND PRODUCTS 
Plants contain many proteins and pathways that when blocked or induced lead to cell, 
organ or whole plant death. Gene variants that influence these pathways can have profound 
effects on plant survival, vigor and performance. The critical pathways include those 
concerned with metabolism and development or protection against stresses, diseases and 
pests. They also include those involved in apoptosis and necrosis. The applicants have 
elucidated many such genes and pathways by discovering genes that when inactivated lead to 
cell or plant death. 

Herbicides are, by definition, chemicals that cause death of tissues, organs and whole 
plants. The genes and pathways that are activated or inactivated by herbicides include those 
that cause cell death as well as those that function to provide protection. The applicants have 
elucidated these genes. 

The genes defined in this section have many uses including manipulating which cells, 
tissues and organs are selectively killed, which are protected, making plants resistant to 
herbicides, discovering new herbicides and making plants resistant to various stresses. 

IV.A. 1. IDENTIFICATION OF VIABILITY GENES, GENE COMPONENTS 
AND PRODUCTS 

Viability genes identified here are defined as genes, gene components and products 
capable of inhibiting cell, tissue, organ or whole plant death or protecting cells, organs and 
plants against death and toxic chemicals or stresses. Examples of such viability genes and 
gene products are shown in the Reference, Sequence, Protein Group, Protein Group Matrix 
tables, MA_diff, MA_clust, Knock-in and Knock-out tables. The biochemical functions of 
the protein products of many of these genes determined from comparisons with known 
proteins are also given in the Reference tables. 

Viability Genes, Gene Components And Products Identified By Phenotvpic 
Observations 

These genes were discovered and characterized from a much larger set of genes by 
experiments designed to find genes that cause serious disturbances in progeny survival, seed 
germination, development, embryo and/or seedling growth. In these experiments, viability 
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genes were identified by either (1) ectopic expression of a cDNA in a plant or (2) 
mutagenesis of a plant genome. The plants were then cultivated and one or more of the 
following phenotypes, which varied from the parental wild-type was observed: 

A. Gametophytic loss of progeny seedlings (detected from a parent on the 
basis of a linked herbicide resistance gene showing abnormal segregation 
ratios, as revealed by treating with herbicide) 

B. Embryo death, resulting in some cases to loss of seed 

C. Pigment variation in cotyledons and leaves, including absence of 
chlorophyll, which leads to seedling death. 

1 . Abinos 

2. Yellow/greens 

D. Cotyledons produced but no or few leaves and followed by seedling death. 

E. Very small plantlets 

The genes identified in these experiments are shown in Tables X. 

Viability Genes, Gene Components And Products Identified By Differential 
Expression 

Viability genes were also identified from a much larger set of genes by experiments 
designed to find genes whose mRNA products changed in concentration in response to 
applications of different herbicides to plants. Viability genes are characteristically 
differentially transcribed in response to fluctuating herbicide levels or concentrations, 
whether internal or external to an organism or cell. The MA_diff Table reports the changes 
in transcript levels of various viability genes in entire seedlings at 0, 4, 8, 12, 24, and 48 
hours after a plant was sprayed with a Hoagland's nutrient solution enriched with either 2,4 D 
(Trimec), Glean, Grassgetter, Roundup, or Finale herbicides as compared to seedlings 
sprayed with Hoagland's solution only. 

The MA_diff Table(s) reports the transcript levels of the experiment (see EXPT ID: 
108467, 107871, 107876, 108468, 107881, 108465, 107896, 108466, 107886, 107891, 108501). 
For transcripts that had higher levels in the samples than the control, a "+" is shown. A "-" is 
shown for when transcript levels were reduced in root tips as compared to the control. For more 
experimental detail see the Example section below. 

Viability genes are those sequences that showed differential expression as compared to 
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controls, namely those sequences identified in the MA_diff tables with a "+" or "-" indication 

Viability Genes Identified By Cluster Analyses Of Differential Expression 
Viability Genes Identified By Correlation To Genes That Are 
Differentially Expressed 

As described above, the transcription profiles of genes that act together are well 
correlated. Applicants not only have identified the genes that are differentially expressed in the 
microarray experiments, but also have identified the genes that act in concert with them. The 
MA_clust table indicates groups of genes that have well correlated transcription profiles and 
therefore participate in the same pathway or network. 

A pathway or network of Viability genes is any group in the MA_clust that comprises a 
cDNA ID that also appears in Expt ID 108467, 107871, 107876, 108468, 107881, 108465, 
107896, 108466, 107886, 107891, 108501 of the MA_diff table(s). 

Viability Genes Identified By Correlation To Genes That Cause 
Physiological Consequences 

Additionally, the differential expression data and the phenotypic observations can be 
merged to identify pathways or networks of Viability genes. A group in the MA_clust is 
considered a Viability pathway or network if the group comprises a cDNA ID that also appears 
in Knock-in or Knock-out tables that causes one or more of the phenotypes described in section 
above. 

Viability Genes Identified By Amino Acid Sequence Similarity 
Viability genes from other plant species typically encode polypeptides that share amino 
acid similarity to the sequences encoded by corn and Arabidopsis Viability genes. Groups of 
Viability genes are identified in the Protein Group table. In this table, any protein group that 
comprises a peptide ED that corresponds to a cDNA ID member of a Viability pathway or 
network is a group of proteins that also exhibits Viability functions/utilities. 

It is assumed that those gene activity changes in response to the toxic herbicides are 
either responsible, directly or indirectly, for cell death or reflect activation of defense pathways. 
These genes are therefore useful for controlling plant viability. 

Examples of phenotypes, biochemical activities, or transcript profiles that can be 
modulated using selected viability gene components are described above and below. 
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IV.A.2. USE OF VIABILITY GENES, GENE COMPONENTS AND 
PRODUCTS TO MODULATE PHENOTYPES 

Deficiencies in viability genes can cause cell death at various rates and under various 
conditions. Viability genes can be divided into two classes; (1) those that lead to cell death 
under permissive growth conditions and (2) those that cause cell demise under restrictive 
conditions. Examples of the first class are viability genes which encode toxins or which 
participate in the programmed cell death pathway(s). Disruption of metabolic pathways, such 
as amino acid synthesis, may not cause death when the cell is supplemented with appropriate 
amino acids, but can cause death under more restrictive conditions. 

Some deficiencies in viability genes identified cause the organism as a whole to die, 
while other genes cause death only of a specific subset of cells or organs. For example, genes 
identified from embryo viability phenotypes can cause an entire organism to die. In contrast, 
genes characterized from gametophytic lethals may inhibit cell growth only in a select set of 
cells. In addition, some viability genes may not cause an immediate demise. A seedling 
lethal phenotype is one such example, where a seed germinates and produces cotyledons but 
the plant dies before producing any true leaves. Yellow-green pigment mutants provide yet 
another set of examples. In some cases, the plant produces a number of yellow-green leaves 
but dies before producing any seed, due in part, to the necessity to produce chlorophyll in 
functioning chloroplasts to fix CO2. 

Viability genes, in which mutational deficiencies lead to death, carry no duplicates in 
the haploid plant genome. They thus may be especially likely to promote viability and vigor 
when expressed more optimally in a plant, in specific tissues or throughout the plant. 

Proteins which lead to death when inactivated, and other proteins in the pathways in 
which they act, are potential targets for herbicides. In this kind of application, chemicals 
specifically capable of interacting with such proteins are discovered. Typically, this could be 
done by designing a gene involving the relevant viability gene, that also facilitates a rapid 
easily measured assay for the functioning of the protein product, and treating plants 
containing the new genes with the potential herbicides. Those chemicals specifically 
interfering with the protein activity can then easily be selected for further development. 
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Genes whose products interact directly with a herbicide can also be modified such 
that the herbicide no longer inactivates the protein. Such genes are useful for making 
herbicide resistant plants, valuable in agriculture. 

Many of the genes activated or inactivated by the herbicides define genes involved in 
the pathways that protect the plant against damage and stresses. These genes and gene 
components, especially those regulating such pathways, are especially useful for enhancing 
the ability of plants to withstand specific stresses, including herbicides. [See the sections on 
Stress responsive genes, gene components and products.] 

Genes that cause cellular death can be used to design new genes that cause death of 
specific cells and tissues and hence new valuable products. For example, activation of genes 
causing death in cells specifying seeds can be used to produce fruits lacking seeds. They can 
also be used to prevent cell death by pathogens and pests. 

The genes and gene components of the instant invention are useful to modulate one or 
more processes that affect viability and vigor at the (I) cellular level; (II) organelle level; (III) 
organ level; or (IV) overall organism level. 

Examples of phenotypes that are modulated are described above and below: 

I. CELLULAR LEVEL: 

Viability genes and gene products are useful in modulating cellular changes in: 

A. Cell size 

B. Cell differentiation 

C. Cell division 

D. Cell longevity 

E. Cell position 

F. Cytotoxins 

n. ORGANELLE LEVEL 

The development, growth, and viability of chloroplasts and/or mitochondria can be 
modulated by the genes and gene products of the instant invention: 

m. ORGAN LEVEL: 

The invention is also useful to regulate the development, growth, and viability of the 
following organs: 



Page 377 of 772 



Attorney Dkt. 2750-1483P 
Client Dkt. 80090.005 



A. Flower 1 . number 
2. size 

B. Seed 
1. Size 
Number 
Composition 



- Amino Acid, carbohydrates, lipid, and secondary metabolites 



C. Fruit 

1. Size 

2. Number 

3. Composition 

- Amino Acid, carbohydrates, lipid, and secondary metabolites 

1 . Fruit Drop 

2. Fruit Ripening 

D. Leaf 

1. Size 

2. Composition 

3. Amino acid, carbohydrates, lipid, and secondary metabolites 

4. Photoefficiency 

5. Abscission 

6. Senescence 

E. Stem 

F. Root 



IV. OVERALL ORGANISM LEVEL 

The following traits can be modulated with the genes, gene components and gene 
products of this invention to affect the strength of a plant as a whole: 
A. Vigor 



Increased biomass 



Stress Tolerance 



l.Cold 



2. Drought 



3. Heat 
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4. Herbicide 

5. Oxidative 

6. Salt 

C. Pathogen Resistance 

To regulate any of the phenotype(s) above, activities of one or more of the viability 
genes or gene products can be modulated in an organism and tested by screening for the 
desired trait. Specifically, the gene, mRNA levels, or protein levels can be altered in a plant 
utilizing the procedures described herein and the phenotypes can be assayed. As an example, 
a plant can be transformed according to Bechtold and Pelletier ( Methods. Mol. Biol. 82:259- 
266 (1998)) and/or screened for variants as in Winkler et al, Plant Physiol 118 : 743-50 and 
visually inspected for the desired phenotype or metabolically and/or functionally assayed. 

IV.A.3. USE OF VIABILITY GENES, GENE COMPONENTS AND 
PRODUCTS TO MODULATE BIOCHEMICAL ACTIVITIES 
The viability genes, their components and/or products can be used to modulate 
processes, biochemical or metabolic activities and/or pathways such as those noted below. 
Such biological activities can be measured according to the citations included in the table 
below: 



PROCESS 


BIOCHEMICAL OR 
METABOLIC ACTIVITIES 
AND/OR PATHWAYS 


CITATIONS INCLUDING 
ASSAYS 


Amino Acid Synthesis 


Aceto -lactate synthase 


Hershey et al. (1999) Plant 
Mol. Biol. 40, 795-806 


Cell Wall Synthesis 


Cellulose synthase 


Peng et al. (2001) Plant 
Physiol. 126, 981-982 
Kawagoe and Delmer 
(1997) Genet Eng. 19, 63-87 


Nucleotide Synthesis 


Coenzyme A biosynthesis 


Kupke et al. (2001) J. Biol. 
Chem. 276, 19190-19196 


Lipid Synthesis 


Oleosin biosynthesis 


Singh et al. (2000) 
Biochem. Soc. Trans. 28, 
925-927 
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PROCESS 

• 


BIOCHEMICAL OR 
METABOLIC ACTIVITIES 
AND/OR PATHWAYS 


CITATIONS INCLUDING 
ASSAYS 








Zou et al. (1996). Plant Mol. 








Biol. 31,429-433 




Hormone Signaling 


Brassinolide and light signal 


Kangetal. (2001) Cell 105, 




Pathways 


transduction 


625-636 




Hormone Biosynthesis 


Cytokinin biosynthesis 


Takei et al, (2001) J. Biol. 
Chem. 276, 26405-26410 


u 


Secondary Metabolites 


• Carotenoid biosynthesis 


Estevez et al. (2001) J. Biol. 
Chem. 276, 22901-22909 
Carol and Kuntz (2001) 
Trendy Plant Sci. 6, 31-36 
Pogson and Rissler (2001) 








Phil. Trans. Roy. Soc. Lord. 








B 355, 1395-1400 


If! 


Clearing of Toxic 


Ubiquitination 




5 


Substances 






i : 


Growth, Differentiation 


• Farnesylation 


Pei et al (1998) Science 282: 


0^ 

Sir 


And Development 




287-290; Cutler et al. (1996) 


?* 

1 ? 






Science 273: 1239 




• Nitrogen Metabolism 


Goupil et al (1998) J Exptl 
Botany 49:1855-62 




Water Conservation And 


♦ Stomatal Development And 


Allen et al. (1999) Plant 




Resistance To Drought 


Physiology 


Cell 11: 1785-1798 




And Other Related 




Li et al. 2000 Science 287: 




Stresses 


• Stress Response Pathways 

• Inhibition Of Ethylene 


300-303 

Burnett Et Al 2000. J. Exptl 
Botany 51: 197-205 
Raschke (1987) In: Stomatal 
Function Zeiger et al. Eds., 
253-279 

Bush And Pages (1998) 
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PROCESS 


BIOCHEMICAL OR 


CITATIONS INCLUDING 




METABOLIC ACTIVITIES 


ASSAYS 




AND/OR PATHWAYS 






Production Under Low Water 


Plant Mol. Biol. 37: 425-35 




Potential 


Spollen Et Al (2000) Plant 




. Proline And Other Osmolite 


Physiol. 122:967-976 




Synthesis And Degradation 








Hare et al. (1998) Plant, Cell 






And Environment 21:535- 






553; Hare et al. (1999) J. 






Exptl. Botany 50:413-434 


Programmed cell death 


• Proteases 


Kamens et al. (1995) J. Biol. 




• DNA endonucleases 


Chem. 270, 15250-15256 




• Mitochondriae uncoupling 


Wang etal. (2001) 




proteins 


Anticancer Res. 21, 1789- 






1794 






Drake et al. (1996) Plant 






Mol. Biol 304, 755-767 






Mittlerand Lam (1995) 






Plant Cell 7, 1951-1962 






Mittlerand Lam (1995) 






Plant Physiol. 108, 489-493 






Thelen and Northcote 






(1989) Plantal79, 181-195 






Hanak and Jezek (2001) 






FEBS Lett. 495, 137-141 




• Plasmalemma and Tonoplast 


Macrobbie (1998) Philos 




Ion Channel Changes 


Trans R Soc Lond B Biol 






Sci353: 1475-88; Li etal 






(2000) Science 287:300- 






303; Barkla etal. (1999) 




• Ca2+ Accumulation 


Plant Physiol. 120:811-819 
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PROCESS 


BIOCHEMICAL OR 


CITATIONS INCLUDING 




METABOLIC ACTIVITIES 


ASSAYS 




AND/OR PATHWAYS 








Lacombe et al. (2000) Plant 






Cell 12: 837-51; Wanget 






al. (1998) Plant Physiol 




. K+ Efflux 


118:1421-1429; Shi etal. 






(1999) Plant Cell 11:2393- 




• Activation Of Kinases And 


2406 




Phosphatases 


Gaymard et al. (1998) Cell 






94:647-655 






Jonak et al. (1996) Proc. 






Natl. Acad. Sci 93: 11274- 






79; Sheen (1998) Proc. Natl. 






Acad. Sci. 95: 975-80; Allen 






etal. (1999) Plant Cell 11: 






1785-98 



Other biological activities that can be modulated by the viability genes, their 
components and products are listed in the Reference tables. Assays for detecting such 
biological activities are described in the Protein Domain table. 

IV.A.4. USE OF VIABILITY GENES. GENE COMPONENTS AND 

PRODUCTS TO MODULATE TRANSCRIPT LEVELS OF OTHER 
GENES 

The expression of many genes is "up regulated" or "down regulated" following 
herbicide treatment and also in the leaf mutants, because some "viability" genes and their 
products are integrated into complex networks that regulate transcription of many other 
genes. Some "viability genes" are therefore useful for modifying the transcription of other 
genes and hence complex phenotypes, as described above. The data from differential 
expression experiments can be used to identify a number of types of transcript profiles of 
"viability genes", including "early responders," and "delayed responders", " early responder 
repressors" and " delayed repressors". Profiles of these different types responsive genes are 
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shown in the Table below together with examples of the kinds of associated biological 
activities. "Up-regulated" profiles are those where the mRNA transcript levels are higher in 
the herbicide treated plants as compared to the untreated plants. "Down-regulated" profiles 
represent higher transcript levels in the untreated plant as compared to the herbicide treated 
plants. 



TRANSCRIPT 


TYPE OF GENES 


PHYSIOLOGICAL 


EXAMPLES OF 


LEVELS 


WHOSE 


CONSEQUENCES 


BIOCHEMICAL 




TRANSCRIPTS ARE 


OF MODIFYING 


ACTIVITIES WHOSE 




CHANGED 


GENE PRODUCT 


TRANSCRIPTS ARE 






LEVELS 


CHANGED 


Up Regulated 


• Early Responders 


. Suppression of 

rr 


• Transcription 


Transcripts 


To: 


cell, tissue, organ 


Factors 


(Level At 4 Hr = 0 


• Gluphosinate 


or plant death 


Transporters 


Hr) or 


. Chlorsulfuron 


following: 


• Change In Cell 


(Level At 4 Hr > 0 


• Glyphosate 


• Herbicide 


Membrane Structure 


Hr) 


and/or 2, 4-D 


treatment or 


• Kinases And 






under stress 


Phosphatases 






Activation of cell, 


• Germins, Germin- 






tissue, organ or 


like proteins, 






plant death 


Calcium-binding 






following: 


proteins and H2O2 






Herbicide 


generating and 






treatment or 


H2O2 neutralizing 






under stress 


proteins. 








Transcription 








Activators 








• Change In 








Chromatin Structure 








And/Or Localized 








DNA Topology 








Annexins, cell wall 








structural proteins 
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TRANSCRIPT 


TYPE OF GENES 


PHYSIOLOGICAL 


EXAMPLES OF 


LEVELS 


WHOSE 


CONSEQUENCES 


BIOCHEMICAL 




TRANSCRIPTS ARE 


OF MODIFYING 


ACTIVITIES WHOSE 




CHANGED 


GENE PRODUCT 


TRANSCRIPTS ARE 






LEVELS 


CHANGED 


Up Regulated 


Delayed Responders to 


• Suppression of 


• Transcription 


Transcripts 


Gluphosinate, 


cell, tissue, organ 


Factors 


(Level At 4 Hr < 12 


Chlorsulfuron, 


or plant death 


Specific Factors 


Hr) 


Glyphosate and/or 2, 4- 


following: 


(Initiation And 




D 


• Herbicide 


Elongation) For 






treatment or 


Protein Synthesis 






under stress 


• Lipid transfer 






• Activation of cell, 


proteins 






tissue, organ or 


• Myrosinase-binding 






plant death 


proteins 






following: 


• Sugar 






• Herbicide 


interconverting 






treatment or 


enzymes 






under stress 


Maintenance Of 








mRNA Stability 








• Maintenance Of 








Protein Stability 








• Maintenance Of 








Protein-Protein 








Interaction 








Protein 








translocation factors 








• RNA-binding 








proteins 








• Centromere and 








cytoskeleton 








proteins 
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TRANSCRIPT 


TYPE OF GENES 


PHYSIOLOGICAL 


EXAMPLES OF 


LEVELS 


WHOSE 


CONSEQUENCES 


BIOCHEMICAL 




TRANSCRIPTS ARE 


OF MODIFYING 


ACTIVITIES WHOSE 




CHANGED 


GENE PRODUCT 


TRANSCRIPTS ARE 






LEVELS 


CHANGED 








Lipases 








• Zn/Cu transporters 








Cell wall structural 








proteins 


Down-Regulated 


Early Responder 


• Suppression of 


• Transcription 


Transcripts 


Repressors Of Stress 


cell, tissue, organ 


Factors 


(Level At 0 Hr = 4 


Response State Of 


or plant death 


• Change In Protein 


Hr) or 


Metabolism 


following: 


Structure By 


(Level At 0 Hr > 4 




Herbicide 


Phosphorylation 


Hr) 


Genes With 


treatment or 


(Kinases) Or 




Discontinued 


under stress 


Dephosphoryaltion 




Expression Or UnsTable 


Activation of cell, 


(Phosphatases) 




mRNA In Presence Of 


tissue, organ or 


• Change In 




Herbicide or Abiotic 


plant death 


Chromatin Structure 




Stress 


following: 


Arid/Or DNA 






• Herbicide 


Topology 






treatment or 


• H2O2 neutralizing 






under stress 


proteins 






• Zn/Cu transporters 


• Neutralizing 






. Cell wall 


proteins including 






structural proteins 


SOD and GST 


Down-Regulated 


Delayed Responder 


• Suppression of 


• Transcription 


Transcripts 


Repressors Of ABA 


cell, tissue, organ 


Factors 


(Level At4Hr> 12 


Function State Of 


or plant death 


• Kinases And 


Hr) 


Metabolism 


following: 


Phosphatases 






• Herbicide 


. Stability Of Factors 
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TRANSCRIPT 


TYPE OF GENES 


PHYSIOLOGICAL 


EXAMPLES OF 


LEVELS 


WHOSE 


CONSEQUENCES 


BIOCHEMICAL 




TRANSCRIPTS ARE 


OF MODIFYING 


ACTIVITIES WHOSE 




CHANGED 


GENE PRODUCT 


TRANSCRIPTS ARE 






LEVELS 


CHANGED 




Genes With 


treatment or 


For Protein 




Discontinued 


under stress 


Synthesis And 






• Activation of cell, 


Degradation 






tissue, organ or 


• Amino Acid 






plant death 


biosynthesis 






following: 


proteins including 






• Herbicide 


aspargive synthase 






treatment or 




- 




under stress 






Expression Or Unstable 




• Ca-binding proteins 




mRNA In Presence Of 




Lipid biosynthesis 




herbicide or Abiotic 




proteins 




Stress 




• Lipases 








• Zn/Cu transporters 








Cell wall structural 








proteins 



While viability modulating polynucleotides and gene products can act alone, 
combinations of these polynucleotides also affect growth and development. 



IV.A.5. USE OF PROMOTERS OF VIABILITY GENES, GENE 
COMPONENTS AND PRODUCTS 
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Promoters of viability genes can include those that are induced by (1) destructive 
chemicals, e.g. herbicides, (2) stress, or (3) death. These promoters can be linked operably to 
achieve expression of any polynucleotide from any organism. Specific promoters from 
viability genes can be selected to ensure transcription in the desired tissue or organ. Proteins 
expressed under the control of such promoters can include those that can induce or accelerate 
death or those that can protect plant cells organ death. For example, stress tolerance can be 
increased by using promoters of viability genes to drive transcription of cold tolerance 
proteins, for example. Alternatively, promoters induced by apoptosis can be utilized to drive 
transcription of antisense constructs that inhibit cell death. 



Page 387 of 772 



Attorney Ukt. 27 30-14831* 
Client Dkt. 80090.005 



IV.B. HISTONE DEACETYLASE (AXEL) RESPONSIVE GENES, GENE 
COMPONENTS AND PRODUCTS 

The deacetylation of histones is known to play an important role in regulating gene 
expression at the chromatin level in eukaryotic cells. Histone deacetylation is catalyzed by \ 
proteins known as histone deacetylases (HDAcs). HDAcs are found in multisubunit 
complexes that are recruited to specific sites on nuclear DNA thereby affecting chromatin 
architecture and target gene transcription. . Mutations in plant HDAc genes cause alterations in 
vegetative and reproductive growth that result from changes in the expression and activities 
of HDAc target genes or genes whose expression is governed by HDAc target genes. For 
example, transcription factor proteins control whole pathways or segments of pathways and 
proteins also control the activity of signal transduction pathways. Therefore, manipulation of 
these types of protein levels is especially useful for altering phenotypes and biochemical 
activities. 

Manipulation of one or more HDAc gene activities are useful to modulate the 
biological activities and/or phenotypes listed below. HDAc genes and gene products can act 
alone or in combination. Useful combinations include HDAc genes and/or gene products with 
similar biological activities, or members of the same, co-regulated or functionally related 
biochemical pathways. Such HDAc genes and gene products can function to either increase 
or dampen these phenotypes or activities. 

Examples of genes whose expression is affected by alterations in HDAc activity are 
shown in the Reference and Sequence Tables. These genes and/or gene products are 
responsible for effects on traits such as inflorescence branching and seed production. They 
were discovered and characterized from a much larger set of genes by experiments designed 
to find genes whose mRNA products are affected by a decrease in HDAc gene activity. These 
experiments made use of an Arabidopsis mutant having severely reduced mRNA levels for 
the histone deactylase gene AtHDACl. 



thousands of genes in a single experiment. This is achieved by simultaneously hybridizing 
two differentially labeled fluorescent cDNA pools to glass slides that contain spots of DNA 
(Schena et al. (1995) Science 270: 467-70). The Arabidopsis Functional Genomics 
Consortium (AFGC) has recently made public the results from such microarray experiments 
conducted with AFGC chips containing 10,000 non-redundant ESTs, selected from 37,000 



Microarray technology allows monitoring of gene expression levels for 
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randomly sequenced ESTs generated from mRNA of different tissues and developmental 
stages. 

The sequences of the ESTs showing at least two-fold increases or decreases over the 
controls were identified, compared to the Ceres full-length cDNA and genomic sequence 
databanks, and identical Ceres clones identified. MA_diff table reports the results of this 
analysis, indicating those Ceres clones which are up or down regulated over controls, thereby 
indicating the Ceres clones which are HDAc genes. The MA_diff Table(s) reports the 
transcript levels of the experiment (see EXPT ID: Axel (relating to SMD 6654, SMD 6655)). 
For transcripts that had higher levels in the samples than the control, a "+" is shown. A "-" is 
shown for when transcript levels were reduced in root tips as compared to the control. For 
more experimental detail see the Example section below. 

Histone Deacetylase genes are those sequences that showed differential expression as 
compared to controls, namely those sequences identified in the MA_diff tables with a "+" or 
"-" indication 

Histone Deacetylase Genes Identified By Cluster Analyses Of Differential Expression 
Histone Deacetylase Genes Identified By Correlation To Genes That Are 
Differentially Expressed 

As described above, the transcription profiles of genes that act together are 
well correlated. Applicants not only have identified the genes that are differentially 
expressed in the microarray experiments, but also have identified the genes that act in concert 
with them. The MA_clust table indicates groups of genes that have well correlated 
transcription profiles and therefore participate in the same pathway or network. 

A pathway or network of Histone Deacetylase genes is any group in the 
MA_clust that comprises a cDNA ID that also appears in Expt ID Axel (relating to SMD 
6654, SMD 6655) of the MA_diff table(s). 

Histone Deacetylase Genes Identified By Correlation To Genes That Cause 
Physiological Consequences 

Additionally, the differential expression data and the phenotypic observations 
can be merged to identify pathways or networks of Histone Deacetylase genes. A group in 
the MA clust is considered a Histone Deacetylase pathway or network if the group comprises 
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a cDNA ID that also appears in Knock-in or Knock-out tables that causes one or more of the 
phenotypes described in section above. 



Histone Deacetylase Genes Identified By Amino Acid Sequence Similarity 
Histone Deacetylase genes from other plant species typically encode 
polypeptides that share amino acid similarity to the sequences encoded by corn and 
Arabidopsis Histone Deacetylase genes. Groups of Histone Deacetylase genes are identified 
in the Protein Group table. In this table, any protein group that comprises a peptide ID that 
corresponds to a cDNA ID member of a Histone Deacetylase pathway or network is a group 
of proteins that also exhibits Histone Deacetylase functions/utilities. 



IV.B.l.USE OF HP AC GENES, GENE COMPONENTS AND PRODUCTS 

TO MODULATE PHENOTYPES 
HDAc genes and gene products are useful to or modulate one or more of the 
following phenotypes: 
Growth Rate 

• Whole Plant, Including Height, Flowering Time, Etc. 

• Seedling 

• Organ 

. Stem 
Leaves 
Roots 
Flowers 

• Fruits 

• Seeds 
. Yield 

• Size, Weight 

• Seed Development 
Embryo 

• Germination 

• Cell Differentiation 
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To improve any of the phenotype(s) above, activities of one or more of the HDAc 
genes or gene products can be modulated and tested by screening for the desired trait. 
Specifically, the gene, mRNA levels, or protein levels can be altered in a plant utilizing the 
procedures described herein and the phenotypes can be assayed. As an example, a plant can 
be transformed according to Bechtold and Pelletier (1998, Methods. Mol. Biol. 82:259-266) 
and visually inspected for the desired phenotype or metabolically and/or functionally assayed 
according to Wu et al. (2000,, Plant J 22: 19-27), Hu et al. (2000, J Biol Chem 275: 15254- 
64), Johnson and Turner (1999, Semin Cell Dev Biol 10: 179-88), Koyama et al (2000, 
Blood 96: 1490-5), Wu et al. (2000, Plant J 22: 19-27), Li (1999, Nature Genetics 23: 5-6), 
Adams et al. (2000, Development 127: 2493-2502) and Lechner et al. (2000, Biochemistry 
39: 1683-92). 

IV.B.2. USE OF HDAC DEVELOPMENT GENES, GENE COMPONENTS 
AND PRODUCTS TO MODULATE BIOCHEMICAL ACTIVITIES 



The activities of one or more of the HDAc genes can be modulated to change 
biochemical or metabolic activities and/or pathways such as those noted below. Such 
biological activities can be measured according to the citations included in the Table below: 



PROCESS 


BIOCHEMICAL OR 
METABOLIC ACTIVITIES 
AND/OR PATHWAYS 


CITATIONS INCLUDING 
ASSAYS 


Growth , Differentiation 
And Development 


Cell Differentiation 
Cell Cycle Progression 


Koyama et al. (2000) Blood 
96: 1490-5 

Hu et al. (2000) J Biol Chem 
275: 15254-64 


Metabolism 


. Chromatin Structure 

• Gene Transcription And 
Chromatin Assembly 


Hu et al. (2000) J Biol Chem 

275: 15254-64 

Johnson and Turner (1999) 

Semin Cell Dev Biol 10: 179- 

88 


Reproduction And Seed 
Development 


. Seed Development 


Wu et al. (2000) Plant J 22:19- 
27 
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BIOCHEMICAL OR 




PROCESS 


METABOLIC ACTIVITIES 


CITATIONS INCLUDING 




AND/OR PATHWAYS 


ASSAYS 




• Seed Germination 


Lechner et al. (2000) 




Independent Embryo 


Biochemistry 39: 1683-92 




Fertilization 


Ohad et al. (1996) PNAS USA 
93: 5319-24 




. Fertilization Independent 






Seed Development 


Chaudhury et al. (1997) PNAS 
USA 94: 4222-28 




. Megagametogenesis 


Christensen et al. (1997) Sex 
Plant Reproduc 10: 49-64 



Other biological activities that can be modulated by the HDAc genes and gene 
products are listed in the REFERENCE Table. Assays for detecting such biological activities 
are described in the Protein Domain table. 

Profiles of these different HDAc genes are shown in the Table below with examples 
of associated biological activities. 



Page 392 of 772 




Attorney UKt. 

Client Dkt. 80090.005 



TRANSCRIPT 


TYPE OF GENES 


PHYSIOLOGICAL 


EXAMPLES OF 


LEVELS 




CONSEQUENCES 


BIOCHEMICAL 








ACTIVITY 


Up Regulated 


Responders To 


Gene Repression 


• Transporters 


Transcripts 


HDAc Activity 


Activity 


Metabolic enzymes 






. Cell Cycle 


Kinases and 






Progression 


phosphatases 






Chromatin 


Transcription 






Condensation 


activators 






• Synthesis Of 


Change in 






Metabolites 


chromatin structure 






And/Or Proteins 


and/or localized 








DNA topology 






• Modulation Of 








Transduction 








Pathways 








Specific Gene 








Transcription 








Initiation 




Down-Regulated 


Responder To Hdac 


• Negative 


• Transcription 


Transcripts 


Inhibitors 


Regulation Of 


factors 






Acetylation 


Change in protein 






Pathways 


structure by 




Genes With 




phosphorylation 




Discontinued 


Changes In 


(kinases) or 




Expression Or 


Pathways And 


dephosphorylation 




UnsTable Mrna In 


Processes 


(phosphatases) 




Presence Of Hdac 


Operating In Cells 


• Change in 






Changes In 


chromatin structure 






Metabolism 


and/or DNA 








topology 
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TRANSCRIPT 
LEVELS 


TYPE OF GENES 


PHYSIOLOGICAL 
CONSEQUENCES 


EXAMPLES OF 
BIOCHEMICAL 
ACTIVITY 








. Stability of factors 
for protein synthesis 
and degradation 
Metabolic enzymes 



Use of Promoters of Histone Deacetylase Responsive Genes 

Promoters of Histone Deacetylase responsive genes are useful for transcription of any 
desired polynucleotide or plant or non-plant origin. Further, any desired sequence can be 
transcribed in a similar temporal, tissue, or environmentally specific patterns as the Histone 
Deacetylase responsive genes where the desired sequence is operably linked to a promoter of 
a Histone Deacetylase responsive gene. The protein product of such a polynucleotide is 
usually synthesized in the same cells, in response to the same stimuli as the protein product of 
the gene from which the promoter was derived. Such promoter are also useful to produce 
antisense mRNAs to down-regulate the product of proteins, or to produce sense mRNAs to 
down-regulate mRNAs via sense suppression. 
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V. STRESS RESPONSIVE GENES, GENE COMPONENTS AND PRODUCTS 

V.A. COLD RESPONSIVE GENES, GENE COMPONENTS AND PRODUCTS 

The ability to endure low temperatures and freezing is a major determinant of the 
geographical distribution and productivity of agricultural crops. Even in areas considered 
suiTable for the cultivation of a given species or cultivar, can give rise to yield decreases and 
crop failures as a result of aberrant, freezing temperatures. Even modest increases (1-2°C) in the 
freezing tolerance of certain crop species would have a dramatic impact on agricultural 
productivity in some areas. The development of genotypes with increased freezing tolerance 
would provide a more reliable means to minimize crop losses and diminish the use of energy- 
costly practices to modify the microclimate. 

Sudden cold temperatures result in modulation of many genes and gene products, 
including promoters. Examples of such cold responsive genes and gene products are shown in 
the Reference, Sequence, Protein Group, Protein Group Matrix tables, MA_diff and MA_clust 
tables These genes and/or products are responsible for effects on traits such as plant vigor and 
seed yield. They were discovered and characterized from a much larger set by experiments 
designed to find genes whose mRNA products changed in response to cold treatment. 

Manipulation of one or more cold responsive gene activities are useful to modulate 
the biological activities and/or phenotypes listed below. Cold responsive genes and gene 
products can act alone or in combination. Useful combinations include cold responsive genes 
and/or gene products with similar transcription profiles, similar biological activities, or 
members of the same or functionally related biochemical pathways. Whole pathways or 
segments of pathways are controlled by transcription factor proteins and proteins controlling 
the activity of signal transduction pathways. Therefore, manipulation of the levels of such 
proteins is especially useful for altering phenotypes and biochemical activities of plants. The 
MA_diff Table(s) reports the transcript levels of the experiment (see EXPT ID: 108578, 
108579, 108533, 108534 ). For transcripts that had higher levels in the samples than the 
control, a "+" is shown. A "-" is shown for when transcript levels were reduced in root tips 
as compared to the control. For more experimental detail see the Example section below. 

Cold genes are those sequences that showed differential expression as compared to 
controls, namely those sequences identified in the MA_diff tables with a "+" or "-" indication 
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Cold Genes Identified By Cluster Analyses Of Differential Expression 

Cold Genes Identified By Correlation To Genes That Are Differentially 

Expressed 

As described above, the transcription profiles of genes that act together are 
5 well correlated. Applicants not only have identified the genes that are differentially 

expressed in the microarray experiments, but also have identified the genes that act in concert 
with them. The MA_clust table indicates groups of genes that have well correlated 
transcription profiles and therefore participate in the same pathway or network. 

A pathway or network of Cold genes is any group in the MA_clust that 
10 comprises a cDNA ID that also appears in Expt ID 108578, 108579, 108533, 108534 of 
theMA_difftable(s). 

Cold Genes Identified By Correlation To Genes That Cause Physiological 
Q Consequences 

J5 Additionally, the differential expression data and the phenotypic observations 

J can be merged to identify pathways or networks of Cold genes. A group in the MA_clust is 
W considered a Cold pathway or network if the group comprises a cDNA ID that also appears in 
Knock-in or Knock-out tables that causes one or more of the phenotypes described in section 
L above. 

5o 

m 

Cold Genes Identified By Amino Acid Sequence Similarity 
Q Cold genes from other plant species typically encode polypeptides that share 

amino acid similarity to the sequences encoded by corn and Arabidopsis Cold genes. Groups 

of Cold genes are identified in the Protein Group table. In this table, any protein group that 
25 comprises a peptide ID that corresponds to a cDNA ID member of a Cold pathway or 

network is a group of proteins that also exhibits Cold functions/utilities. 

Such cold responsive genes and their products can function to either increase or 

dampen the phenotypes and activities below either in response to cold treatment or in the 

absence of cold temperature fluctuations. 
30 Further, promoters of cold responsive genes, as described in the Reference tables, for 

example, are useful to modulate transcription that is induced by ABA or any of the following 

phenotypes or biological activities below. 
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V.A.I. USE OF COLD-RESPONSIVE GENES TO MODULATE 
PHENOTYPES 

Cold responsive genes and gene products are useful to or modulate one or 
more of the following phenotypes: 

- Cold Tolerance, below 7°C, for example 

- Cells 
Organelles 

- Proteins 
Dehydration Resistance 
Growth rate 

- Whole Plant, including height, bolting time, etc. 
Organs 

- Roots 

- Lateral Roots 

- Leaves 
Stems 

- Flowers 

- Fruit 
Biomass 

- Fresh and Dry Weight during any time in plant life, such as maturation; 

- Number, Size, and/or Weight 

- Flowers; 
Seeds; 
Branches; 

- Leaves; 

- Seed Yield 

- Number, Size, Weight, Harvest Index, Water Content 

- Fruit Yield 

- Number, Size, Weight, Harvest Index, Water Content 

To regulate any of the phenotype(s) above, activities of one or more of the 
cold responsive genes or gene products can be modulated and the plants can be tested 
by screening for the desired trait. Specifically, the gene, mRNA levels, or protein 
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levels can be altered in a plant utilizing the procedures described herein and the 
phenotypes can be screened for variants as in Winkler et al. (1998) Plant Physiol 118: 
743-50 and assayed, for example, in accordance to Steponokus et al. (1993) 
Biochimica et Biophysica Acta 1 145: 93-104; Quinn (1988) Symp Soc. Exp. Biol. 42: 
237-258; Bectold and Pelletier (1998) Methods Mol. Biol. 82: 259-266; Kasuga et al. 
(1999) Nature Biotechnology 17: 287-291; Guy et al. (1998) Cryobiology 36: 301- 
314; or Liu et al. (1998) Plant Cell 10: 1391-1406. 

V.A.2. USE OF COLD-RESPONSIVE GENES TO MODULATE 
BIOCHEMICAL ACTIVITIES 

The activities of one or more of the cold responsive genes can be modulated to 
change biochemical or metabolic activities and/or pathways such as those noted 
below. Such biological activities are documented and can be measured according to 
the citations above and those included in the Table below: 



PROCESS 


BIOCHEMICAL OR 
METABOLIC ACTIVITIES 
AND/OR PATHWAYS 


CITATIONS INCLUDING ASSAYS 


Cold Tolerance 


Viability Of Plant 
Protoplasts At Low 
Temperatures. 


Steponkus (1998) PNAS USA 95: 
14570-14575 




Viability Of Yeast At Low 
Temperatures. 


Schirmer et al. (1994) Plant Cell 6: 
1899-1909 




Complementation Of Yeast 
Tsp Mutant 


Zentella et al. (1999) Plant 
Physiology, 119: 1473-1482 




Viability Of E.Coli At Low 
Temperatures. 


Yeh et.al. (1997) PNAS 94: 10967- 
10972 




Induction Of Cold Shock 
Response Genes 


Pearce (1999) Plant Growth 
Regulation 29: 47-76. 
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PROCESS 


BIOCHEMICAL OR 
METABOLIC ACTIVITIES 
AND/OR PATHWAYS 


CITATIONS INCLUDING ASSAYS 


Lipid Composition 


Altered Composition Of 
Membrane Fatty Acids 


Sayanova et al. (1999) Journal of 
Experimental Botany 50: 1647-1652 
Sayanova (1997) PNAS 
USA 94:4211-4216 




Alteration Of Lipoxygenase 
Enzyme Accumulation And 
Activity 


Porta et al. (1999) Plant and Cell 
Physiology 40: 850-858. 


Protein 
Composition 


- Protein Denaturation 

- Protein Hydrophilicity 


Wisniewski et al.(1999) Physiologia 
Plantarum 105: 600-608 
Steponkus (1998) PNAS USA 95: 
14570-14575 


Modulation of 
Transcription 
Induced by Low 
Temperatures 


- Induced Transcription 
Factors And Other Dna 
Binding Proteins 

Transcription Of 
Specific Genes 


Current Protocols in Molecular 
Biology / edited by Frederick M. 
Ausubel .. [et al.]. New York : 
Published by Greene Pub. Associates 
and Wiley-Interscience : J. Wiley, 
cl987. 

Steponkus (1998) PNAS USA 95: 
14570-14575 

Kadyrzhanova et al, Plant Mol Biol 
(1998) 36(6): 885-895; and 
Pearce et al., Plant Physiol (1998) 
117(3): 787-795 


Signal 

Transduction 


Plasma Membrane 
Proteins 


Goodwin et al., Plant Mol Biol (1996) 
31(4) 777-781; and 
Koike et al, Plant Cell Physiol (1997) 
38(6): 707-716 



Page 399 of 772 



Attorney Dkt. 2 

Client Dkt. 80090.005 





PROCESS 


BIOCHEMICAL OR 


CITATIONS INCLUDING ASSAYS 






METABOLIC ACTIVITIES 








AND/OR PATHWAYS 






Oxygen 


- Glutathione 


Kocsy et al., Planta (2000) 210(2): 




Scavengers 




295-301 






- Accumulation Active O2 


Tao et al., Cryobiology (1998) 






and H2O2 Scavengers 


37(l):38-45 




Dehydration 


- Dehydrin 


Ismail et al., Plant Physiol (1999) 








120(l):237-244 






- Transcription of mRNA 


Kaye et al., Plant Physiol (1998) 








116(4): 1367-1377 




Metabolism 


Soluble Sugars and/or' 


Wanner et al., (1999) Plant Physiol 






Proline 


120(2): 391-400 




RNA/DNA 


Stabilization of 


Jiang, Weining et al.,(1997) Journal of 




Chaperone 


RNA/DNA through 


Biological Chemistry, 272: 196-202. 


FIJ 
; zz: 

y i 




RNA binding and 


Fukunaga et al., (1999) Journal of 






modulation of RNA 


Plant Research, 112:263-272. 


5 




translation through RNA 








binding and or 




t : 




unwinding. 






Protein Chaperone 


Stabilize protein 


Forreiter and Nover (1998) Journal of 






structure and facilitate 


Biosciences 23: 287-302 






protein folding 





Other biological activities that can be modulated by the cold responsive genes and 
their products are listed in the Reference tables. Assays for detecting such biological 
activities are described in the Protein Domain table. 
5 Cold responsive genes are characteristically differentially expressed in response to fluctuating 
cold temperature levels, whether internal or external to an organism or cell. The MA_diff 
table reports the changes in transcript levels of various cold responsive genes in the aerial 
parts of seedlings at 1 and 6 hours at 4°C in the dark as compared to aerial parts of seedlings 
covered with aluminium foil, and grown at 20°C in the growth chamber. 
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The data from this time course can be used to identify a number of types of cold 
responsive genes and gene products, including "early responders" and "delayed responders". 
Profiles of these different cold responsive genes are shown in the Table below together with 
examples of the kinds of associated biological activities. 
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GENE 


FUNCTIONAL 


TYPE OF 


EXAMPLES OF 


EXPRESSION 


CATEGORY OF 


BIOLOGICAL 


BIOCHEMICAL 


LEVELS 


GENE 


ACTIVITY 


ACTIVITIES OF GENE 








PRODUCTS 


Upregulated Genes 


Early 


- Perception Of 


- Transcription Factors 


(Level At 1 h = 6 h) 


Responders To 


Cold 


- Kinases And 


or 


Cold 


- Induction Of 


Phosphatases 


(Level At 1 h > 6 h) 




Cold Response 


- Amino Acid Sugar And 






Signal 


Metabolite Transporters 






Transduction 


- Carbohydrate Catabolic 






Pathway s 


And Anabolic Enzymes. 






- Initiating 


- Lipid Biosynthesis 






Specific Gene 


Enzymes 






Transcription 


- Lipid Modification 






- Osmotic 


Enzymes, Example 






Adjustment 


Desaturases 






- Alteration Of 


- Ice Crystal Binding 






Lipid 


Proteins 






Composition. 


- Hydrophilic Proteins 






- Ice Nucleation 








Inhibition 








- Mitigation Of 








Dehydration By 








Sequestering 








Water 
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GENE 


FUNCTIONAL 


TYPE OF 


EXAMPLES OF 


EXPRESSION 


CATEGORY OF 


BIOLOGICAL 


BIOCHEMICAL 


LEVELS 


GENE 


ACTIVITY 


ACTIVITIES OF GENE 








PRODUCTS 




Stress Response 


- Repression Of 


-Transcription Factors 






General 


- Kinases And 






Biochemical 


Phosphatases 






Pathways To 


- Protein Stability Factors 






Optimize Cold 


- mRNA Stability Factors 






Response 


- mRNA Translation 






Pathways. 


Factors 






-Stabilization Of 


- Protein Turnover Factors 






Protein /Enzyme 


Oxygen Radical 






Activity At Low 


Scavengers, Example- 






Temperature 


Peroxidases 






-Protection 


- Energy Generation 






Against Oxidative 


Enzymes EtOH 






Stress 


Detoxification 






-Anaerobic 








Metabolism 
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GENE 


FUNCTIONAL 


TYPE OF 


EXAMPLES OF 


EXPRESSION 


CATEGORY OF 


BIOLOGICAL 


BIOCHEMICAL 


LEVELS 


GENE 


ACTIVITY 


ACTIVITIES OF GENE 








PRODUCTS 


Upregulated Genes 


Delayed 


- Respiration, 


-Transcription Factors 


(Level At lh<6h) 


Responders To 


Photosynthesis 


- Kinases And 




Cold Stress 


And Protein 


Phosphatases 




-Cold 


Synthesis 


- Protein Stability Factors 




Acclimation 


- Carbohydrate 


- mRNA Stability Factors 




Genes 


And Amino Acid 


- mRNA Translation 






Solute 


Factors 






Accumulation 


- Protein Turnover Factors 






- Increased Fatty 


- Oxygen Radical 






Acid Desaturation 


Scavengers, Peroxidase 






To Increase Lipid 


- Metabolic Enzymes 






Membrane 








Stability 








- Increased 








Accumulation Or 








Activity Of 








Oxidative Stress 








Protection Proteins 








- Stabilization Of 








Protein /Enzyme 








Activity At Low 








Temperature 








- Protection 








Against Oxidative 








Stress 








- Extracellular 








Matrix 








Modification 
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GENE 


FUNCTIONAL 


TYPE OF 


EXAMPLES OF 


EXPRESSION 


CATEGORY OF 


BIOLOGICAL 


BIOCHEMICAL 


LEVELS 


GENE 


ACTIVITY 


ACTIVITIES OF GENE 








PRODUCTS 




Stress Response 


- Stabilization Of 


- Transcription Factors 




Genes 


Protein /Enzyme 


- Kinases And 






Activity At Low 


Phosphatases 






Temperature 


- Protein Stability Factors 






- Protection 


- mRNA Stability Factors 






Against Oxidative 


- mRNA Translation 






Stress 


Factors 






- Anaerobic 


- Protein Turnover Factors 






Metabolism 


- Oxygen Radical 








Scavengers, Example- 








Peroxidase 








- Energy Generation 








Enzymes, Etoh 








Detoxification 



p 

m 

: sr 
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GENE 




FUNCTIONAL 


TYPE OF 


EXAMPLES OF 


EXPRESSION 




CATEGORY OF 


BIOLOGICAL 


BIOCHEMICAL 


LEVELS 




GENE 


ACTIVITY 


ACTIVITIES OF GENE 










PRODUCTS 


Downregulated 




- Early 


- Negative 


- Transcription Factors 


(Level At 1 h = 


6h) 


Responder 


Regulation Of 


- Kinases And 


(Level At 6 h > 


lh) 


Repressors Of 


Cold Signal 


Phosphatases 






Cold Stress 


Transduction 


- Protein Stability Factors 






Metabolism 


Pathways Released 


- mRNA Stability Factors 










- mRNA Translation 










Factors 










- Protein Turnover Factors 






- Genes With 


- Negative 


- Cold Repressed 






Discontinued 


Regulation Of 


Metabolic Pathway 






Expression Or 


Cold Induced 


Proteins 






UnsTable 


Transcription 


- Factors Coordinating And 






mRNA In Cold 


Reduced 


Controlling Central C and 








- Reduction In 


N Metabolism 








Gene Expression 


- Storage Proteins 








In Pathways Not 










Required Under 










Cold Conditions 










- Induced mRNA 










Turnover 
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GENE 


FUNCTIONAL 


TYPE OF 


EXAMPLES OF 


EXPRESSION 


CATEGORY OF 


BIOLOGICAL 


BIOCHEMICAL 


LEVELS 


GENE 


ACTIVITY 


ACTIVITIES OF GENE 








PRODUCTS 


Down-Regulated 


- Delayed 


- Maintenance Of 


Transcription Factors 


Transcripts 


Responder 


Cold Induced State 


- Kinases And 


(Level At 1 h > 6 h) 


Repressors Of 


Of Metabolism 


Phosphatases 




Cold Stress 


- Reduction In 


- Protein Stability Factors 




Metabolism 


Gene Expression 


- mRNA Stability Factors 




- Genes With 


For Pathways Not 


- mRNA Translation 




Discontinued 


Required Under 


Factors 




Expression Or 


Cold Conditions 


- Protein Turnover Factors 




UnsTable 


- Induced mRNA 






mRNA In Cold 


Turnover 


- Cold Repressed 








Metabolic Pathway 








Proteins 








- Factors Coordinating And 








Controlling Central C and 








N Metabolism 








- Storage Proteins 



Further, any desired sequence can be transcribed in similar temporal, tissue, or 
enviromentally specific patterns as the cold responsive genes when the desired sequence is 
operably linked to a promoter of a cold responsive gene. 
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V.B. HEAT RESPONSIVE GENES, GENE COMPONENTS AND PRODUCTS 

The ability to endure high temperatures is a major determinant of the geographical 
distribution and productivity of agricultural crops. Decreases in yield and crop failure 
frequently occur as a result of aberrant, hot conditions even in areas considered suiTable for 
the cultivation of a given species or cultivar. Only modest increases in the heat tolerance of 
crop species would have a dramatic impact on agricultural productivity. The development of 
genotypes with increased heat tolerance would provide a more reliable means to minimize 
crop losses and diminish the use of energy-costly practices to modify the microclimate. 

Changes in temperature in the surrounding environment or in a plant microclimate 
results in modulation of many genes and gene products. Examples of such heat stress responsive 
genes and gene products are shown in the Reference, Sequence, Protein Group, Protein Group 
Matrix, MA_diff and MA_clust tables. These genes and/or products are responsible for effects 
on traits such as plant vigor and seed yield. They were discovered and characterized from a 
much larger set by experiments designed to find genes whose mRNA products changed in 
response to high temperatures. 

While heat stress responsive polynucleotides and gene products can act alone, 
combinations of these polynucleotides also affect growth and development. Useful 
combinations include different heat stress responsive polynucleotides and/or gene products 
that have similar transcription profiles or similar biological activities, and members of the 
same or similar biochemical pathways. Whole pathways or segments of pathways are 
controlled by transcription factor proteins and proteins controlling the activity of signal 
transduction pathways. Therefore, manipulation of such protein levels is especially useful for 
altering phenotypes and biochemical activities of plants. In addition, the combination of a 
heat stress responsive polynucleotide and/or gene product with other environmentally 
responsive polynucleotide is also useful because of the interactions that exist between stress 
pathways, pathogen stimulated pathways, hormone regulated pathways, nutritional pathways 
and development. Here, in addition to polynucleotides having similar transcription profiles 
and/or biological activities, useful combinations include polynucleotides that may have 
different transcription profiles, but which participate in common or overlapping pathways. 
The MA_diff Table(s) reports the transcript levels of the experiment (see EXPT ID: 108576, 
108577, 108522, 108523). For transcripts that had higher levels in the samples than the 
control, a "+" is shown. A "-" is shown for when transcript levels were reduced in root tips 
as compared to the control For more experimental detail see the Example section below. 
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Heat genes are those sequences that showed differential expression as compared to 
controls, namely those sequences identified in the MA_diff tables with a "+" or "-" indication 



Heat Genes Identified By Cluster Analyses Of Differential Expression 

Heat Genes Identified By Correlation To Genes That Are Differentially 

Expressed 

As described above, the transcription profiles of genes that act together are 
well correlated. Applicants not only have identified the genes that are differentially 
expressed in the microarray experiments, but also have identified the genes that act in concert 
with them. The MA_clust table indicates groups of genes that have well correlated 
transcription profiles and therefore participate in the same pathway or network. 

A pathway or network of Heat genes is any group in the MA_clust that 
comprises a cDNA ID that also appears in Expt ID 108576, 108577, 108522, 108523 of the 
MA_difftable(s). 

Heat Genes Identified By Correlation To Genes That Cause Physiological 

Consequences 

Additionally, the differential expression data and the phenotypic observations 
can be merged to identify pathways or networks of Heat genes. A group in the MA_clust is 
considered a Heat pathway or network if the group comprises a cDNA ID that also appears in 
Knock-in or Knock-out tables that causes one or more of the phenotypes described in section 
above. 



Heat Genes Identified By Amino Acid Sequence Similarity 

Heat genes from other plant species typically encode polypeptides that share 
amino acid similarity to the sequences encoded by corn and Arabidopsis Heat genes. Groups 
of Heat genes are identified in the Protein Group table. In this table, any protein group that 
comprises a peptide ED that corresponds to a cDNA ID member of a Heat pathway or 
network is a group of proteins that also exhibits Heat functions/utilities. 

Such heat stress responsive genes and gene products can function either to increase or 
dampen the above phenotypes or activities either in response to changes in temperature or in 
the absence of temperature fluctuations. 

Further, promoters of heat responsive genes, as described in the Reference tables, for 
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example, are useful to modulate transcription that is induced by heat or any of the following 
phenotypes or biological activities below. 

V.B.I. USE OF HEAT STRESS RESPONSIVE GENES TO MODULATE 
PHENOTYPES 

Heat stress responsive genes and gene products can be used to alter or modulate one 
or more of the following phenotypes: 

- Heat Tolerance, above 20°C, 23°C, 27°C, 30°C, 33°C, 37°C, 40°C or 42°C 

- Of Cells 

Of Organelles 

- Of Proteins 
Dehydration Resistance 

- Of Cells 

Of Organelles 
Growth Rate 

- Whole Plant, including height, bolting time, etc. 
Organs 

- Roots 
Lateral Roots 

- Leaves 
Stems 

- Flowers 
Fruit 

- - Biomass 

- Fresh and Dry Weight during any time in plant life, such as maturation; 

- Number, Size, and Weight of 

- Flowers; 
Seeds; 

- Branches; 
Leaves; 

- Seed Yield 

Number, Size, Weight, Harvest Index 

- Fruit Yield 
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- Number, Size, Weight, Harvest Index 
Stress Responses 

Mediation of response to desiccation, drought, salt, disease, wounding, 
cold and other stresses. 
Reproduction 

To regulate any of the phenotype(s) above, activity of one or more of the heat stress 
responsive genes or gene products can be modulated and the plants tested by screening for 
the desired trait. Specifically, the gene, mRNA levels, or protein levels can be altered in a 
plant utilizing the procedures described herein and the phenotypes can be assayed. As an 
example, a plant can be transformed according to Bechtold and Pelletier (1998, Methods. 
Mol. Biol. 82:259-266) and/or screened for variants as in Winkler et al. (1998) Plant Physiol 
118: 743-50 and visually inspected for the desired phenotype or metabolically and/or 
functionally assayed according to Queitsch et al. (2000, The Plant Cell 12: 479-92). 



V.B.2. USE OF HEAT STRESS RESPONSIVE GENES TO MODULATE 
BIOCHEMICAL ACTIVITIES 

The activities of one or more of the heat stress responsive genes can be 
modulated to change biochemical or metabolic activities and/or pathways such as 
those noted below. Such biological activities can be measured according to the 
citations included in the Table below: 



PROCESS 


BIOCHEMICAL OR 
METABOLIC ACTIVITIES 
AND/OR PATHWAYS 


CITATION INCLUDING 
ASSAY 


Cell Growth and 


-Regulation And 


Wisniewski et al. 


Differentiation 


Molecular Chaperones 


(1999) Physiolgia 






PI ant arum 105: 




-Maintenance Of Native 


600-608 




Conformation (Cytosolic 


Queitsch et al. 




Proteins) 


(2000) The Plant 






Cell 12: 479-92 




-Reactivation Of 






Aggregation And Protein 


Lee and Vierling 




Folding 


(2000) Plant 
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PROCESS 


BIOCHEMICAL OR 
METABOLIC ACTIVITIES 
AND/OR PATHWAYS 


CITATION INCLUDING 
ASSAY 






Physiol. 122: 189- 




-Autoregulation Of Heat 


197 




Shock Response 


Schwechheimer 






(1998) Plant Mol 






Biol 36: 195-204 






Shi et al. (1998) 






Genes and 




-Regulation Of 


Development 12: 




Translational Efficiency 


654-66 






Wells et al. (1998) 




-Regulation Of Kinase 


Genes and 




Activity 


Development 12: 




-Regulation Of Calcium 


3236-51 




Mediated Signal 


Lis et al. (2000) 




Transduction 


Genes and 






Development 14: 






792-803 






-Malho, R.(1999) Plant 






Biology 1: 487-494. 






-Sheen, Jen.( 1996) 






Science 274: 1900- 






1902. 






- Farmer, P. et al., (1999.) 






Biochimica et Biophysica 






Acta 1434: 6-17. 


Gene regulation 


• Transcriptional Regulation Of 
Heat Induced Proteins 
Through DNA Binding 


-Current Protocols in 
Molecular Biology / edited 
by Frederick M. Ausubel .. 
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PROCESS 


BIOCHEMICAL OR 
METABOLIC ACTIVITIES 
AND/OR PATHWAYS 


CITATION INCLUDING 
ASSAY 




Proteins. 


[et al.]. New York : 






Published by Greene Pub. 






Associates and Wiley- 






Interscience : J. Wiley, 






C1987. 






-Steponkus(1998) 






PNAS USA 95: 






14570-14575 






- Gubler et al. 






(1999) Plant 






Journal 17: 1-9 




. Transcriptional Regulation Of 






Heat Induced Proteins 


- Glenn et al. 




Through Protein-Protein 


(1999) Journal of 




Interactions Between DNA 


Biological 




Binding Proteins And 


Chemistry, 274: 




Coactivators. 


36159-36167 




• Transcriptional Regulation Of 






Heat Induced Proteins 


-Zhou et al., (1997) 




Through Protein 


EMBO Journal 16:3207- 




Phosphorylation And 


3218. 




Dephosphorylation 


- Sessa et al., 






(2000) EMBO Journal 19: 






2257-2269. 






- Burnett et 






al.,(2000) Journal 






of Experimental 






Botany. 51: 197- 




• Transcriptional Regulation Of 


205. 
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PROCESS 


BIOCHEMICAL OR 


CITATION INCLUDING 




METABOLIC ACTIVITIES 


ASSAY 




AND/OR PATHWAYS 






Thermal Stress Induced 






Genes By Protein-Protein 


- Osterlund et 




Interactions. 


al.,(2000) Nature 






405: 462-466. 




• Translational Regulation Of 






Thermal Stress Induced 






Messenger Rnas. 








- Gross and 




• Transcriptional Regulation Of 


Watson (1998) 




Heat Induced Genes Through 


Canadian Journal of 




Chromatin Remodeling. 


Microbiology, 






44:341-350 






- Luo, R. X., Dean, D.C. 






(1999) . 






Journal of the 






National Cancer 






Institute 91: 1288- 






1294. 






-Chromatin protocols 






(1999) edited by Peter B. 






Becker. Totowa, N.J. : 






Humana Press. 


Cell Structure 


• Thermal Stress Protection By 


- Goodwin et al. (1996) 




Plasma Membrane Anchored 


Plant Mol Biol 31(4) 777- 




Or Secreted And/Or Cell 


781; and 




Wall Associated Proteins. 


Koike et al. (1997) Plant 






Cell Physiol 38(6): 707- 






716 
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PROCESS 


BIOCHEMICAL OR 


CITATION INCLUDING 




METABOLIC ACTIVITIES 


ASSAY 




AND/OR PATHWAYS 




Signal Transduction 


• Regulation Of Thermal Stress 


- Jonak (1996) Proceedings 




Pathways And Protein 


of the National Academy of 




Activity By Protein Kinase 


Sciences of the United 




And Protein Phosphatase 


States of America, 93: 




Mediated Phosphorylation 


11274-11279. 




And Dephosphorylation 


-Monroy.et al., (1998) 




Respectively. 


Analytical Biochemistry 
265: 183-185. 


Photosynthesis 


Regulation Of 


Schroda et al. (1999) The 




Photoprotection And Repair 


Plant Cell 11: 1165-178 




Of Photosystem II 


Oh and Lee (1996) J Plant 
Biol. 39: 301-07 


Stress Response 


• Regulation Of Cytosol 


Dat et al. (1998) Plant 




Peroxide Levels 


Physiol 116: 1351-1357 




• Regulation Of Heat Shock 
Factor Binding 


Kurek et al. (1999) Plant 
Physiol 119: 693-703 

Storozhenko et al. (1998) 
Plant Physiol 118: 1005-14 
Soto etal. (1999) Plant 
Physiol 120: 521-28 




• Regulation Of Protein 


Yeh et al. (1997) PNAS 94: 




Stability During Thermal 


10967-10972 




Stress 


Winkler et al. (1998) Plant 
Physiol 118: 743-50 
Saavedra et al. (1997) 




• Nucleocytoplasmic Export Of 


Genes and Development 




Heat Shock Protein Mrnas 


11: 2845-2856 
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PROCESS 


BIOCHEMICAL OR 


CITATION INCLUDING 




METABOLIC ACTIVITIES 


ASSAY 




AND/OR PATHWAYS 








Parsell and Lindquist 




• Regulation/Reconfiguration 


(1993). Ann. Rev. Genet. 




Of Cell Architecture 


27: 437-496. 






Parsell and Lindquist 




• Regulation Of Pathways For 


(1993). Ann. Rev. Genet. 




Reactivation Of "Damaged" 


27: 437-496. 




And/Or Denatured Proteins 


Georgopoulos and Welch 






(1993). Ann Rev. Cell Biol. 






9:601-634. 






- Vierstra, Richard D. 




Regulation Of Protein 


(1996) Plant Molecular 




Degradation During Thermal 


Biology,32:275-302. 




Stress. 


- Vierstra, Richard D.; 






Callis, Judy. (1999) Plant 






Molecular Biology, 






41:435-442. 






-Liu, J. et al., (1998)Plant 






Science 134:11-20. 




• Regulation Of Osmotic 


- Freestone, P. 1997et al., 




Potential During Thermal 


Journal of Molecular 




Stress. 


Biology, v. 274:318-324. 






- Robertson,A.J. (1994) 




• Regulation Of Universal 


Plant Physiology 105: 181- 




Stress Protein Homologue 


190. 




Activity By Phosphorylation 






And Dephosphorylation. 
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PROCESS 


BIOCHEMICAL OR 
METABOLIC ACTIVITIES 
AND/OR PATHWAYS 


CITATION INCLUDING 
ASSAY 




. Regulation Of Dehydrin, 
LEA-Like And Other Heat 
STable Protein Accumulation 





Other biological activities that can be modulated by the heat stress responsive genes 
and gene products are listed in the Reference tables. Assays for detecting such biological 
activities are described in the Protein Domain table. 

Heat stress responsive genes are characteristically differentially transcribed in 
response to fluctuating temperatures, whether internal or external to an organism or cell. The 
MA_diff table reports the changes in transcript levels of various heat stress responsive genes 
in aerial tissues at 1 and 6 hours after plants were placed at 42°C as compared to aerial tissues 
kept at 20°C growth chamber temperature. 

The data from this time course can be used to identify a number of types of heat stress 
responsive genes and gene products, including "early responders to heat stress," "delayed 
responders to heat stress," "early responder repressors," and "delayed repressor responders." 
Profiles of these different heat stress responsive genes are shown in the Table below together 
with examples of the kinds of associated biological activities. 



GENE 

EXPRESSION 
LEVELS 


FUNCTIONAL 
CATEGORY OF 
GENE 


PHYSIOLOGICAL 
CONSEQUENCES 


EXAMPLES OF 
BIOCHEMICAL 
ACTIVITIES / GENE 
PRODUCTS 


Up Regulated 
Transcripts 
(Level At lh«6h) 
Or 

(Level At lh>6h) 


Early Responders 
To Heat Stress 


. Heat Stress Perception 
• Modulation Of Heat 

Stress Response 

Transduction 

Pathways 


. Transcription 
Factors 

• Transporters 

• Changes In Cell 
Membrane Structure 
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GENE 


FUNCTIONAL 


PHYSIOLOGICAL 


EXAMPLES OF 


EXPRESSION 


CATEGORY OF 


CONSEQUENCES 


BIOCHEMICAL 


LEVELS 


GENE 




ACTIVITIES / GENE 








PRODUCTS 






• Specific Gene 


Kinases And 






Transcription 


Phosphatases 






Initiation 










• Transcription 






• Conditional Shift In 


Activators 






Preferential 


Changes In 






Translation Of 


Chromatin Structure 






Transcripts 


And/Or Localized 








Dna Topology 








• Modification Of Pre- 






♦ Changes In Cell 


Existing Translation 






Architecture To 


Factors By 






Optimize Cell 


Phosphorylation 






Adaptation To Heat 


(Kinases) Or 






Stress 


Dephosphorylation 








(Phosphatases) 








• Synthesis Of New 








Translation Factors 








. Stability Of 








Mediators Of 








Protein-Protein 








Interaction 








Heat Shock Proteins 








Changes In 








Organelle 








Structures, 








Membranes And 
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GENE 


FUNCTIONAL 


PHYSIOLOGICAL 


EXAMPLES OF 


EXPRESSION 


CATEGORY OF 


CONSEQUENCES 


BIOCHEMICAL 


LEVELS 


GENE 




ACTIVITIES / GENE 








PRODUCTS 








Energy-Related 








Activities 








Proteins To Catalyse 








Metabolic Turnover 


Up Regulated 


"Delayed" 




• Transcription 


Transcripts 


Responders 




Factors 


(Level At lh<6h) 




• Maintenance Of 


• Specific Factors 




Maintenance Of 


Response To Heat 


(Initiation And 




Heat Stress 


Stress 


Elongation) For 




Response 




Protein Synthesis 








Maintenance Of 






Maintenance Of 


Mrna Stability 






Protein Stability And 


. Heat Shock Proteins 






Conformation 


♦ Changes In 








Organelle 








Structures, 








Membranes And 








Energy-Related 








Activities 








• Proteins To Catalyse 








Metabolic Turnover. 








. Stability Of 








Mediators Of 








Protein-Protein 








Interaction 


Down-Regulated 


Early Responder 


• Negative Regulation 


• Transcription 
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GENE 


FUNCTIONAL 


PHYSIOLOGICAL 


EXAMPLES OF 


EXPRESSION 


CATEGORY OF 


CONSEQUENCES 


BIOCHEMICAL 


LEVELS 


GENE 




ACTIVITIES / GENE 








PRODUCTS 


Transcripts 


Repressors Of 


Of Heat Stress 


Factors And 


(Level At lh«6h) 


"Normal" State Of 


Response Released 


Activators 


Or 


Metabolism 


Changes In 


Change In Protein 


(Level At 6h> lh) 




Biochemical And 


Structure By 




Genes With 


Signal Transduction 


Phosphorylation 




Discontinued 


Pathways And 


(Kinases) Or 




Expression Or 


Processes Operating In 


Dephosphoryaltion 




UnsTable mRNA 


Cells 


(Phosphatases) 




In Presence Of 


• Reorientation Of 


• Change In 




Heat Stress 


Metabolism 


Chromatin Structure 








And/Or Dna 








Topology 


Down-Regulated 


Delayed 


Maintenance Of Heat 


Transcription 


Transcripts 


Repressors Of 


Stress Response 


Factors And 


(Level At lhr> 


"Normal" State Of 


• Maintenance Of 


Activators 


6hr) 


Metabolism 


Pathways Released 


• Kinases And 






From Repression 


Phosphatases 




Genes With 


Changes In Pathways 


. Stability Of Factors 




Discontinued 


And Processes . 


For Protein 




Expression Or 


Operating In Cells 


Translation 




UnsTable mRNA 


• Reorientation Of 






In Presence Of 


Metabolism 






Heat Stress 







Further, any desired sequence can be transcribed in similar temporal, tissue, or 
environmentally specific patterns as the heat responsive genes when the desired sequence is 
operably linked to a promoter of a heat responsive gene. 

5 
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V.C. DROUGHT RESPONSIVE GENES, GENE COMPONENTS AND 
PRODUCTS 

The ability to endure drought conditions is a major determinant of the geographical 
distribution and productivity of agricultural crops. Decreases in yield and crop failure 
frequently occur as a result of aberrant, drought conditions even in areas considered suiTable 
for the cultivation of a given species or cultivar. Only modest increases in the drought 
tolerance of crop species would have a dramatic impact on agricultural productivity. The 
development of genotypes with increased drought tolerance would provide a more reliable 
means to minimize crop losses and diminish the use of energy-costly practices to modify the 
microclimate. 

Drought conditions in the surrounding environment or within a plant, results in 
modulation of many genes and gene products. Examples of such drought responsive genes 
and gene products are shown in the Reference and Sequence Tables. These genes and/or 
products are responsible for effects on traits such as plant vigor and seed yield. They were 
discovered and characterized from a much larger set by experiments designed to find genes 
whose mRNA products changed in response to availability of water. 

While drought responsive polynucleotides and gene products can act alone, 
combinations of these polynucleotides also affect growth and development. Useful 
combinations include different drought responsive polynucleotides and/or gene products that 
have similar transcription profiles or similar biological activities, and members of the same or 
similar biochemical pathways. Whole pathways, or segments of pathways are controlled by 
transcription factor proteins and proteins controlling the activity of signal transduction 
pathways. Therefore, manipulation of the levels of such proteins is especially useful for 
altering phenotypes and biochemical activities of plants. In addition, the combination of a 
drought responsive polynucleotide and/or gene product with another environmentally 
responsive polynucleotide is also useful because of the interactions that exist between 
hormone regulated pathways, stress pathways, nutritional pathways and development. Here, 
in addition to polynucleotides having similar transcription profiles and/or biological 
activities, useful combinations include polynucleotides that may have different transcription 
profiles but which participate in a common pathway. The MA_diff Table(s) reports the 
transcript levels of the experiment (see EXPT ID: 108572, 108573, 108502, 108503, 108504, 
108556, 108482, 108483, 108473, 108474, 108477). For transcripts that had higher levels in 
the samples than the control, a "+" is shown. A is shown for when transcript levels were 
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reduced in root tips as compared to the control. For more experimental detail see the 
Example section below. 

Drought genes are those sequences that showed differential expression as compared to 
controls, namely those sequences identified in the MA_diff tables with a "+" or "-" indication 



Drought Genes Identified By Cluster Analyses Of Differential Expression 
Drought Genes Identified By Correlation To Genes That Are 
Differentially Expressed 

As described above, the transcription profiles of genes that act together are well 
correlated. Applicants not only have identified the genes that are differentially expressed in 
the microarray experiments, but also have identified the genes that act in concert with them. 
The MA_clust table indicates groups of genes that have well correlated transcription profiles 
and therefore participate in the same pathway or network. 

A pathway or network of Drought genes is any group in the MA clust that comprises 
a cDNA ID that also appears in Expt ID 108572, 108573, 108502, 108503, 108504, 108556, 
108482, 108483, 108473, 108474, 108477 of the MA_diff table(s). 

Drought Genes Identified By Correlation To Genes That Cause 
Physiological Consequences 

Additionally, the differential expression data and the phenotypic observations can be 
merged to identify pathways or networks of Drought genes. A group in the MA_clust is 
considered a Drought pathway or network if the group comprises a cDNA ID that also 
appears in Knock-in or Knock-out tables that causes one or more of the phenotypes described 
in section above. 



Drought Genes Identified By Amino Acid Sequence Similarity 
Drought genes from other plant species typically encode polypeptides that share 
amino acid similarity to the sequences encoded by corn and Arabidopsis Drought genes. 
Groups of Drought genes are identified in the Protein Group table. In this table, any protein 
group that comprises a peptide ID that corresponds to a cDNA ID member of a Drought 
pathway or network is a group of proteins that also exhibits Drought functions/utilities. 

Such drought responsive genes and gene products can function to either increase or 
dampen the above phenotypes or activities either in response to drought conditions or in the 
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absence of drought conditions. Further, promoters of drought responsive genes, as described 
in the Reference tables, for example, are useful to modulate transcription that is induced by 
drought or any of the following phenotypes or biological activities below. 

More specifically, drought responsive genes and gene products are useful to or 
modulate one or more of the following phenotypes: 

• Growth 

• Roots 
Stems 

. Buds 
Leaves 

• Development 
. Cell Growth 

• Leaves 

• Fruit Development 
Seed Development 
Senescence 
Stress Responses 

• Mediates response to desiccation, drought, salt and cold 

Further, any desired sequence can be transcribed in similar temporal, tissue, or 
enviromentally specific patterns as the drought responsive genes when the desired 
sequence is operably linked to a promoter of a drought responsive gene. 

To produce the desired phenotype(s) above, one or more of the drought 
response genes or gene products can be tested by screening for the desired trait. 
Specifically, the gene, mRNA levels, or protein levels can be altered in a plant 
utilizing the procedures described herein and the phenotypes can be assayed. As an 
example, a plant can be transformed according to Bechtold and Pelletier (1998, 
Methods. Mol. Biol. 82:259-266) and/or screened for variants as in Winkler et al. 
(1998) Plant Physiol 118: 743-50 and visually inspected for the desired phenotype or 
metabolically and/or functionally assayed according to Ruzin (1999, In: Plant 
Microtechnique and Microscopy, Oxford University Press, London) and Khanna- 
Chopra et al. (1999, BBRC 255:324-7). 
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Alternatively, the activities of one or more of the drought responsive genes 
can be modulated to change biochemical or metabolic activities and/or pathways such 
as those noted below. Such biological activities can be measured according to the 
citations included in the Table below: 



GENERAL CATEGORY 


BIOCHEMICAL OR 
METABOLIC ACTIVITIES 
AND/OR PATHWAYS 


ASSAY 


Cell Growth and 
Differentiation 


Preservation of Leaf Sub-Cellular 
Structures Including 
Photosynthetic Apparatus 


Jagtapetal. (1998) JExptl 
Botany 49:1715-1721 




Preservation of Cell Membrane 
Structures 


Munne-Bosch and Alegre 
(2000) Planta 210: 925-31 




Regulation of Stomatal 
development and Physiology 


Menke et al. (2000) Plant 
Physiol. 122:677-686. 




Regulation of Factors Involved in 
the Drought-adapted change in 
cell ultrastructure 


Harraketal. (1999) Plant 
Physiol. 121:557-564. 


Physiology 


Modulation of Transpiration 


Allen etal. (1999) Plant 
Cell 11: 1785-98 
Li et al. (2000) Science 
287: 300-303 

Burnett et al. (2000) J Exptl 
Bot51: 197-205 
Raschke (1987) In: 
Stomatal function, Zeiger 
et al., Eds, 253-79 




Modulation of Photosynthesis 


Sung and Krieg (1979) 
Plant Physiol 64: 852-56 
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GENERAL CATEGORY 


BIOCHEMICAL OR 
METABOLIC ACTIVITIES 
AND/OR PATHWAYS 


ASSAY 




Regulation of Epicuticular Wax 
Biosynthesis 


Rhee et al. (1998) Plant 
Physiol 116: 901-11 




Regulation of Carotenoid 
Biosynthesis 


Alegre (2000) Planta 210: 
925-31 

Loggini et al (2000) Plant 
Physiol 119:1091 


Stress Response 


Modulation of Leaf Rolling to 
minimize water loss 


Taiz and Zeiger (1991) In: 
Plant Physiology, 
Benj amin/Cummings 

T* 1 1 ' 1 ' _ /~1 _ Tl 1 ] 

Publishing Co., Redwood 
City, pp 346-70 




Modulation of Osmolite 
Synthesis 


Hareetal. (1998) Plant, 
Cell and Environment 21 : 
535-553 

TT a i f r\ r\r\f\\ T» 1 „ i 

Huan et al. (2000) Plant 
Physiol 122: 747-756 




Regulation of gene 
transcriptional activity specific to 
the establishment of drought 
tolerance 


Hareetal. (1999) J. Exptl. 
Botany 333:413-434. 




Regulation of protein degradation 
and reactivation during drought 
stress condition 


Lee and Vierling (2000) 
Plant Physiol. 122: 189-197 




Modulation/reconfiguration of 
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GENERAL CATEGORY 


BIOCHEMICAL OR 


ASSAY 






METABOLIC ACTIVITIES 








AND/OR PATHWAYS 








translation machineries 


Lis et al. (2000) Genes and 






("recycling" mechanisms) 


Development 14: 792-803 






adapTable to drought condition 






Signal Transduction 


Regulation of Ion Sequestration 


Bush and Jones (1987) Cell 








Calcium 8: 455-72 






Regulation of Nuclear Targeted 








Protein Transport 


Ferringno and Silver 








(1999) Methods in Cell 


w 






Biology 58: 107-22 






Regulation of Cytoplasmic Ca+2 




f~ 






Shi et al. (1999) Plant Cell 


f-x \ 

\ V\ 




Regulation of Kinase Synthesis 


11:2393-2406 






and Activity 










Li et al. (2000) Science 


t: : 




Modulation of Molecular 


287- 300-03 


t hr 
as: 




Chaperone Activity 










Mayhew et al (1996) 








Nature 379: 420-26 








Kimura et al. (1995) 








Science 268:1362-1365. 



Other biological activities that can be modulated by the drought responsive genes and 
gene products are listed in the Reference Tables. Assays for detecting such biological 
activities are described in the Protein Domain table. 
5 Drought responsive genes are characteristically differentially transcribed in response 

to drought conditions, whether internal or external to an organism or cell. The MA_diff 
table(s) report(s) the changes in transcript levels of various drought responsive genes at 1 
and 6 hours after aerial tissues were isolated and left uncovered at room temperature on 3 
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MM paper, as compared to isolated aerial tissues placed on 3 MM paper wetted with 
Hoagland's solution. 

The data from this time course can be used to identify a number of types of drought 
responsive genes and gene products, including "early responders," and "delayed responders." 
Profiles of these different drought responsive genes are shown in the Table below together 
with examples of the kinds of associated biological activities. 



GENE 


FUNCTIONAL 


PHYSIOLOGICAL 


EXAMPLES OF 


EXPRESSION 


CATEGORY OF 


CONSEQUENCES 


BIOCHEMICAL 


LEVELS 


GENE 




ACTIVITIES OF GENE 








PRODUCIa 


Up regulated 


Early responders to 


Drought perception 


Transcription factors 


transcripts 


drought 


leading to the 


Transporters 


(level at 1 hr » 6 hr) 




establishment of 




(level at 1 hr > 6 hr) 




tolerance to drought 








Modulation of drought 


Change in cell membrane 






response transduction 


structure 






pathways 


Kinases and phosphatases 






Specific gene 


Transcription activators 






transcription initiation 


Change in chromatin 








structure and/or localized 








DNA topology 






Conditional shift in 


Modification of pre- 






preferential translation 


existing translation factors 






of transcripts 


by phosphorylation 








(kinases) or 








dephosphorylation 








(phosphatases) 








Synthesis of new 








translation factors 
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GENE 


FUNCTIONAL 


PHYSIOLOGICAL 


EXAMPLES OF 


EXPRESSION 


CATEGORY OF 


CONSEQUENCES 


BIOCHEMICAL 


LEVELS 


GENE 




ACTIVITIES OF GENE 








PRODUCTS 






Changes in cell 


Stability of mediators of 






architecture to optimize 


protein-protein interaction 






cell adaptation to heat 








stress 








Changes in cell 


Synthesis and/or stability 






division cycle 


of factors regulating cell 








division 


Up regulated 


Maintenance of 


Maintenance of 


Transcription factors 


transcripts 


drought response 


response to drought and 


Specific factors (initiation 


(level at 1 hr < 6 hr) 




maintenance of 


and elongation) for protein 




"Delayed" responders 


drought-tolerance 


synthesis 






mechanisms 


RNA-binding proteins 








effective for mRNA 








stability 








Change in chromatin 








structure and/or DNA 








topology 








Stability of mediators of 






Maintenance of 


protein-protein interaction 






mechanisms effective 


Stability of factors to 






for ions sequestration, 


effectively utilize pre- 






osmolite biosynthesis, 


existing translation 






nuclear protein 


machinery ("recycling" 






transport, regulation of 


mechanisms) under 






cytoplasmic Ca+2, and 


drought condition 






regulation of proteins 
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GENE 


FUNCTIONAL 


PHYSIOLOGICAL 


EXAMPLES OF 


EXPRESSION 


CATEGORY OF 


CONSEQUENCES 


BIOCHEMICAL 


LEVELS 


GENE 




ACTIVITIES OF GENE 








PRODUCTS 






effective for 








maintaining protein 








stability and 








conformation 








Maintenance of cellular 








structures 


Stability of mediators of 








protein-protein interaction 


Down-regulated 


Early responder 


Negative regulation of 


Transcription factors and 


transcripts 


repressors of "normal" 


drought response 


activators 


(level at 1 hr » 6 hr) 


state of metabolism 


inducible pathways 


Change in protein structure 


(level at 6 hr > 1 hr) 




released 


by phosphorylation 




Genes with 


Changes in 


(kinases) or 




discontinued 


biochemical and signal 


dephosphoryaltion 




expression or 


transduction pathways 


(phosphatases) 




unsTable mRNA in 


and processes operating 


Change in chromatin 




presence of water 


in cells 


structure and/or DNA 




stress 




topology 


Down-regulated 


Delayed repressors of 


Maintenance of 


Transcription factors and 


transcripts 


"normal" state of 


drought response 


activators 


(level at 1 hr > 6 hr) 


metabolism 


Maintenance of 


Kinases and phosphatases 






pathways released from 


Stability of factors for 




Genes with 


repression 


protein translation 




discontinued 


Changes in pathways 






expression or 


and processes operating 






unsTable mRNA in 


in cells 






presence of water 








stress 







Page 429 of 772 




Attorney Ukt. 

Client Dkt. 80090.005 



Use of Promoters of Drought Responsive Genes 

Promoters of Drought responsive genes are useful for transcription of any desired 
polynucleotide or plant or non-plant origin. Further, any desired sequence can be transcribed 
in a similar temporal, tissue, or environmentally specific patterns as the Drought responsive 
genes where the desired sequence is operably linked to a promoter of a Drought responsive 
gene. The protein product of such a polynucleotide is usually synthesized in the same cells, 
in response to the same stimuli as the protein product of the gene from which the promoter 
was derived. Such promoter are also useful to produce antisense mRNAs to down-regulate 
the product of proteins, or to produce sense mRNAs to down-regulate mRNAs via sense 
suppression. 
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V.D. WOUNDING RESPONSIVE GENES, GENE COMPONENTS AND 
PRODUCTS 

Plants are continuously subjected to various forms of wounding from physical attacks 
including the damage created by pathogens and pests, wind, and contact with other objects. 
Therefore, survival and agricultural yields depend on constraining the damage created by the 
wounding process and inducing defense mechanisms against future damage. 

Plants have evolved complex systems to minimize and/or repair local damage and to 
minimize subsequent attacks by pathogens or pests or their effects. These involve stimulation 
of cell division and cell elongation to repair tissues, induction of programmed cell death to 
isolate the damage caused mechanically and by invading pests and pathogens, and induction 
of long-range signaling systems to induce protecting molecules, in case of future attack. The 
genetic and biochemical systems associated with responses to wounding are connected with 
those associated with other stresses such as pathogen attack and drought. 

Wounding results in the modulation of activities of specific genes and, in 
consequence, of the levels of key proteins and metabolites. These genes, called here 
wounding responsive genes, are important for minimizing the damage induced by wounding 
from pests, pathogens and other objects. Examples of such wounding responsive genes, gene 
components and products are shown in the Reference, Sequence, Protein Group, Protein 
Group Matrix, MA_diff, and MA_clust tables. They can be active in all parts of a plant and 
so where, when and to what extent they are active is crucial for agricultural performance and 
for the quality, visual and otherwise, of harvested products. They were discovered and 
characterized from a much larger set of genes by experiments designed to find genes whose 
products changed in response to wounding. 

Manipulation of one or more wounding responsive gene activities are useful to 
modulate the biological activities and/or phenotypes listed below. Wounding responsive 
genes and gene products can act alone or in combination with genes induced in other ways. 
Useful combinations include wounding responsive genes and/or gene products with similar 
transcription profiles, similar biological activities, or members of functionally related 
biochemical pathways. Whole pathways or segments of pathways are controlled by 
transcription factor proteins and proteins controlling the activity of signal transduction 
pathways. Therefore, manipulation of the levels of such proteins is especially useful for 
altering phenotypes and biochemical activities of plants. The MA_diff Table(s) reports the 
transcript levels of the experiment (see EXPT ID: 108574, 108575, 108524, 108525, and 
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Wounding (relating to SMD 3714, SMD 3715) ). For transcripts that had higher levels in 
the samples than the control, a "+" is shown. A "-" is shown for when transcript levels were 
reduced in root tips as compared to the control. For more experimental detail see the 
Example section below. 

Wounding genes are those sequences that showed differential expression as compared 
to controls, namely those sequences identified in the MA_diff tables with a or "-" 
indication 

Wounding Genes Identified By Cluster Analyses Of Differential Expression 

Wounding Genes Identified By Correlation To Genes That Are Differentially 

Expressed 

As described above, the transcription profiles of genes that act together are 
well correlated. Applicants not only have identified the genes that are differentially 
expressed in the microarray experiments, but also have identified the genes that act in concert 
with them. The MA_clust table indicates groups of genes that have well correlated 
transcription profiles and therefore participate in the same pathway or network. 

A pathway or network of Wounding genes is any group in the MA_clust that 
comprises a cDNA ID that also appears in Expt ID 108574, 108575, 108524, 108525, and 
Wounding (relating to SMD 3714, SMD 3715) of the MA_diff table(s). 

Wounding Genes Identified By Correlation To Genes That Cause 
Physiological Consequences 

Additionally, the differential expression data and the phenotypic observations 
can be merged to identify pathways or networks of Wounding genes. A group in the 
MA_clust is considered a Wounding pathway or network if the group comprises a cDNA ID 
that also appears in Knock-in or Knock-out tables that causes one or more of the phenotypes 
described in section above. 

Wounding Genes Identified By Amino Acid Sequence Similarity 

Wounding genes from other plant species typically encode polypeptides that 
share amino acid similarity to the sequences encoded by corn and Arabidopsis Wounding 
genes. Groups of Wounding genes are identified in the Protein Group table. In this table, 
any protein group that comprises a peptide ID that corresponds to a cDNA ID member of a 
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Wounding pathway or network is a group of proteins that also exhibits Wounding 
functions/utilities. 

Such wounding responsive genes and gene products can function either to increase or 
dampen the phenotypes and activities below, either in response to wounding or in the absence 
of wounding. 

Further, promoters of wounding responsive genes, as described in the Reference tables, 
for example, are useful to modulate transcription that is induced by wounding or any of the 
following phenotypes or biological activities below. 



Wounding responsive genes and gene products can be used to alter or modulate one or 
more of the following phenotypes: 
Growth Rate 

- Whole Plant 

- Height 

- Width 
Flowering Time 

Organs 

- Coleoptile Elongation 
Young Leaves 

- Roots 

- Lateral Roots 

- Tuber Formation 

- Flowers 
Fruit 

- Seeds 
Biomass 

- Fresh And Dry Weight During Any Time In Plant Life, Such As At Maturation 

- Number Of Flowers 

- Number Of Seeds 

- Seed Yield 



V.D.I. USE OF WOUNDING-RESPONSIVE GENES TO MODULATE 



PHENOTYPES 
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- Number 



Size 



- Weight 



Harvest Index 



Content and Composition, e.g., Amino Acid, Nitrogen, Oil, Protein, 



And Carbohydrate 

- Fruit Yield 

- Number 
Size 

- Weight 

- Harvest Index 

- Post Harvest Quality 

- Content And Composition, e.g., Amino Acid, Carotenoid, Jasmonate, Protein, 
And Starch 

Seed and Fruit Development 

- Germination Of Dormant And Non-Dormant Seeds 

- Seed Viability 

Seed Reserve Mobilization 
Fruit Ripening 

- Initiation Of The Reproductive Cycle From A Vegetative State 

- Flower Development Time 

- Insect Attraction For Fertilization 

- Time To Fruit Maturity 
Senescence 

- Fruits, Fruit Drop 

- Leaves 

Stress And Disease Responses 

- Drought 

- Heat And Cold 

- Wounding By Any Source, Including Wind, Objects, Pests And Pathogens 
Uv And High Light Damage 



To regulate any of the phenotype(s) above, activities of one or more of the wounding responsive 



Insect, Fungus, Virus, Worm, Nematode Damage 
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genes or gene products can be modulated and the plants can be tested by screening for the 
desired trait. Specifically, the gene, mRNA levels, or protein levels can be altered in a plant 
utilizing the procedures described herein and the phenotypes can be screened for variants as in 
Winkler et al. (1998) Plant Physiol 118: 743-50 and assayed, for example, in accordance with 
5 Johnson et.al. (1998) Plant Physiol 1 16:643-649, Reymond et.al. (2000) Plant Cell 12 707-720, 
or Keith et.al. (1991) Proc. Nat. Acad. Sci.USA 888821 8825. 



s 

UJ 
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?- 
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V.D.2. USE OF WOUNDING-RESPONSIVE GENES TO MODULATE 
BIOCHEMICAL ACTIVITIES 

The activities of one or more of the wounding responsive genes can be 
modulated to change biochemical or metabolic activities and/or pathways such as 
those noted below. Such biological activities are documented and can be measured 
according to the citations included in the Table below: 



PROCESS 


BIOLOGICAL OR 
METABOLIC ACTIVITIES 
AND/OR PATHWAYS 


CITATIONS INCLUDING 
ASSAYS 


Plant Tissue 
Proliferation 


Cell Damage Repair; Cell 
Division 


Flanders (1990) J. Cell Biol. 
110: 1111-1122 


Wound Induced 
Pathways Providing 
Defense Against 
Pests And Pathogens 


Synthesis Of Jasmonic And 
Salicylic Acids And The 
Pathways Induced By These 
Signaling Molecules. 
Induction Of Jasmonic Acid 
Independent Defense 
Pathways. 

Induction Of Lipoxygenase, 
Thionins And Nodulins 


Reymond, P and Farmer E.E. 
Current Opinion in Plant 
Biology 1998 1:404-411 
Creelman, RA and Mullet, 
J.E. (1997) Ann Rev. Plant 
Physiol Mol Biol 48: 355-387 
Leon et al. 1998 Mol Gen 
Genet 254: 412-419 
Titarentko et al. 1997 Plant 
Physiol 115: 817-826 




Cell Wall Degradation, 
Ethylene Formation, Systemic 
Signaling And Induction Of 
Defense Related Genes 


Rojo, E. et al. 1998. Plant J 
13:153-165 

Ryan, CA and Pearce, G. 
1998. Ann Rev. Cell Dev. 
Biol 14: 1-17 




Specific Rnase Induction 


Reymond, P. et al. 2000 . 
Plant Cell 12:707-720 
Glazebrook, J. 1999. Current 
Opinion in Plant Biol. 2: 280- 
286 

O'Donnel P. J., etal. 1996 
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PROCESS 


BIOLOGICAL OR 


CITATIONS INCLUDING 




METABOLIC ACTIVITIES 


ASSAYS 




AND/OR PATHWAYS 








Science 274: 1914-1917 






Rojoet al. 1999. Plant J. 20: 






135-142 






Merkouropoulus G. et al. 






1999 Planta 208:212-219 






Kariu et al. 1998. Bioscience 






Biotechnology and 






Biochemistry 62: 1144-1151 






Mcoann et al. 1997 PNAS 94: 






5473.5477 


Other Stress Induced 


Abscisic Acid Formation And 


Carrera, E and Prat, S. 1998. 


Pathways 


Its Signaling Pathway 


Plant J 15: 767-771 




Cold Responsive Genes and 


Chao et. al. 1999. Plant 




Pathways 


Physiol 120: 979-992 




Drought Induced Dehydrins 






And Pathways 




Modified Lipid 


Membrane Lipid Synthesis 


Martin, M et al. 1999 Europe 


Motabolism 


Including Omega-3 Fatty 


J. Biochem 262: 283-290 




Acid Desaturase 






Lipases 






Lipid Transfer Proteins 






Induction Of Glycohydrolases 




Modified Sugar And 


And Glycotransferases, 




Energy Metabolism 


Amylases 






Induction Of 






Aminotransferases,Arginase,P 




Modified Protein And 


roteases And Vegetative 




Nitrogen Metabolism 


Storage Proteins, Aromatic 






Amino Acid Synthesis 
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PROCESS 


BIOLOGICAL OR 
METABOLIC ACTIVITIES 
AND/OR PATHWAYS 


CITATIONS INCLUDING 
ASSAYS 








Secondary Metabolite 
Induction 


Aromatic Amino Acid 
Synthesis And Secondary 
Metabolites 


Keith, B et al. 1991 PNAS 88: 
8821-8825 



Other biological activities that can be modulated by wound responsive genes and their 
products are listed in the Reference tables. Assays for detecting such biological activities are 
described in the Protein Domain table. 

The MA_diff table reports the changes in transcript levels of various wound 
responsive genes in the aerial parts of a plant, 1 and 6 hours after the plants were wounded 
with forceps. The comparison was made with aerial tissues from unwounded plants. 

The data from this time course reveal a number of types of wound responsive genes 
and gene products, including "early responders," and "delayed responders." Profiles of the 
individual wounding responsive genes are shown in the Table below together with examples 
of the kinds of associated biological activities that are modulated when the activities of one or 
more such genes vary in plants. 
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TRANSCRIPT 


TYPES OF 


PHYSIOLOGICAL 


EXAMPLES OF 


LEVELS 


GENES 


CONSEQUENCES 


BIOCHEMICAL 








ACTIVITY 


Up Regulated 


Early Responders 


• Induction Of Key 


• Transcription Factors 


Transcripts 


To Wounding 


Signaling Pathways 


• Kinases And 


(Level At lh~6h) 




Within And Between 


Phosphatases 


Or 




Cells 




(Level At lh>6h) 






Jasmonic 






• Modulation Of 


Acid,Salicylic Acid 






Wounding And Stress 


And Nitric Oxide 






Induced Signal 


Pathway Proteins. 






Transduction Pathways 










Glycohydrolases 






• Specific Gene 


Dehydrins 






Transcription Initiation 


Rnases 






Induction Of Repair 


• Metabolic Enzymes 






Processes Or Cell Death 


• Nodulins 








• Cell Division And 








Cell Wall Proteins 








• Cold Response 






• Reorientation Of 


Proteins 






Metabolism, Including 


Lipoxygenase 






Management Of Active 


Jacalin 






Oxygen 


Proteins To Detoxify 








Active Oxygen 








Species 








• Systemin 






• Movement Of Wound 








Induced Signals Through 








Plant 
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TRANSCRIPT 


TYPES OF 


PHYSIOLOGICAL 


EXAMPLES OF 


LEVELS 


GENES 


CONSEQUENCES 


BIOCHEMICAL 
ACTIVITY 








• Biosynthetic 






• Synthesis Of 


Enzymes 






Phytoalexins And 








Secondary Metabolites 




Up Related 


Delayed 


• Maintenance Of 


• Transcription Factors 


Transcripts 


Responders 


Defence Pathways 


• Kinases And 


( Level At lh<6h) 






Phosphatases 




Genes Involved In 


• Maintenance Of 


• Jasmonic 




Wounding 


Reorientated Metabolism 


Acid,Salicylic Acid 




Response At 




And Nitric Oxide 




Distant Sites From 




Pathway Proteins 




Wound. 


• Maintenance Of Wound 
Response 


Glycohydrolases 
Dehydrins 




Genes Involved In 


• Programmed Cell Death 


Rnases 




Maintenance Of 


In Selected Cells 


• Metabolic Enzymes 




Wounding 


Reorientation Of 


• Nodulins 




Response 


Metabolism 


• Cold Response 
Proteins 
Lipoxygenase 
Jacalin 

Proteins To Detoxify 
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TRANSCRIPT 


TYPES OF 


PHYSIOLOGICAL 


EXAMPLES OF 


LEVELS 


GENES 


CONSEQUENCES 


BIOCHEMICAL 








ACTIVITY 








Active Oxygen 








Species 








Cell Division And 








Cell Wall Proteins 






Movement Of Wound 


Systemin 






Induced Signals Through 








Plant 








• Synthesis Of 


• Biosynthetic 






Phytoalexins And 


Enzymes 






Secondary Metabolites 




Down - Regulated 


Early 


Negative Regulation Of 


• Transcription Factors 


Transcripts 


Responder 


Wounding Response 


Change In Protein 


(Level At lh~6h) 


Repressors Of 


Pathways Released 


Structure By 


Or 


Wounding 




Phosphory-Laton 


(Level At6Hr>lh) 


Response 




(Kinases) Or 




State 




Dephos-Phorylation 








(Phosphatases) 








• Change In Chromatin 








Structure And Or 








Dna Topology 






• Changes In Pathways 


• Local Changes In 




Genes With 


And Processes Operating 


Regulatory Proteins, 




Discontinued 


In Cells 


Metabolic Enzymes, 




Expression Or 




Transporters Etc. 




UnsTable 








mRNA 








Following 








Wounding 






Down - Regulated 


Delayed 


• Negative Regulation Of 


Transcription 
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TRANSCRIPT 


TYPES OF 


PHYSIOLOGICAL 


EXAMPLES OF 


LEVELS 


GENES 


CONSEQUENCES 


BIOCHEMICAL 








ACTIVITY 


Transcripts 


Repressors Of 


Wounding Response 


Factors, 


(Level At lhr>6h) 


Wounding 


Pathways Released 


Phosphatases, 




Response State 




Kinases 








• Changes In Protein 








Complex Structures 








• Chromatin 








Restructuring 








Proteins 




Genes With 


Change In Pathways And 


Local Changes In 




Discontinued 


Process Operating In 


Regulatory Proteins, 




Expression Or 


Cells 


Metabolic Enzymes, 




UnsTablemRNA ' 




Transporters Etc. 




Following 








Wounding 


Programmed Cell Death 


• Most Proteins In 








Selected Cells 








Undergoing Death 



Further, any desired sequence can be transcribed in similar temporal, tissue, or 
enviromentally specific patterns as the wounding responsive genes when the desired sequence is 
operably linked to a promoter of a wounding responsive gene. 
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V.E. METHYL JASMONATE (JASMONATE) RESPONSIVE GENES, GENE 

COMPONENTS AND PRODUCTS 
Jasmonic acid and its derivatives, collectively referred to as jasmonates, are naturally 
occurring derivatives of plant lipids. These substances are synthesized from linolenic acid in 
5 a lipoxygenase-dependent biosynthetic pathway. Jasmonates are signalling molecules which 
have been shown to be growth regulators as well as regulators of defense and stress 
responses. As such, jasmonates represent a separate class of plant hormones. 

Changes in external or internal jasmonate concentration result in modulation of the 
activities of many genes and gene products. Examples of such "jasmonate responsive" genes 
10 and gene products are shown in the Reference and Sequence Tables. These genes and/or 
products are responsible for effects on traits such as plant vigor and seed yield, especially 
when plants are growing in the presence of biotic or abiotic stresses. They were discovered 
and characterized from a much larger set of genes by experiments designed to find genes 
O whose mRNA products changed in concentration in response to application of methyl 
J5 jasmonate to plants. 

Jj Manipulation of one or more jasmonate responsive gene activities are useful to 

RJ modulate the biological activities and/or phenotypes tested below. Jasmonate response genes 
m and gene products can act alone or in combination. Useful combinations include jasmonate 
L responsive genes and/or gene products with similar transcription profiles, similar biological 
ifo activities, or members of the same co-regulated or functionally related biochemical pathways. 

Ft \ 
s " : 

"p Whole pathways or segments of pathways are controlled by transcription factor proteins and 

fas. 

j-f proteins controlling the activity of signal transduction pathways. Therefore, manipulation of 
such protein levels is especially useful for altering phenotypes and biochemical activities 
Such jasmonate responsive genes and gene products can function to either increase or 

25 dampen the phenotypes or activities below either in response to changes in jasmonate 

concentration or in the absence of jasmonate fluctuations. The MA_diff Table(s) reports the 
transcript levels of the experiment (see EXPT ID: 108568, 108569, 108555). For transcripts 
that had higher levels in the samples than the control, a "+" is shown. A "-" is shown for 
when transcript levels were reduced in root tips as compared to the control. For more 

30 experimental detail see the Example section below. 

MeJA genes are those sequences that showed differential expression as compared to controls, 
namely those sequences identified in the MA_diff tables with a "+" or "-" indication 
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MeJA Genes Identified By Cluster Analyses Of Differential Expression 

MeJA Genes Identified By Correlation To Genes That Are Differentially 

Expressed 

As described above, the transcription profiles of genes that act together are well 
correlated. Applicants not only have identified the genes that are differentially expressed in 
the microarray experiments, but also have identified the genes that act in concert with them. 
The MA_clust table indicates groups of genes that have well correlated transcription profiles 
and therefore participate in the same pathway or network. 

A pathway or network of MeJA genes is any group in the MA clust that comprises a 
cDNA ID that also appears in Expt ID 108568, 108569, 108555 of the MA_diff table(s). 

MeJA Genes Identified By Correlation To Genes That Cause Physiological 
Consequences 

Additionally, the differential expression data and the phenotypic observations can be 
merged to identify pathways or networks of MeJA genes. A group in the MA_clust is 
considered a MeJA pathway or network if the group comprises a cDNA ID that also appears 
in Knock-in or Knock-out tables that causes one or more of the phenotypes described in 
section above. 

MeJA Genes Identified By Amino Acid Sequence Similarity 

MeJA genes from other plant species typically encode polypeptides that share amino 
acid similarity to the sequences encoded by corn and Arabidopsis MeJA genes. Groups of 
MeJA genes are identified in the Protein Group table. In this table, any protein group that 
comprises a peptide ID that corresponds to a cDNA ID member of a MeJA pathway or 
network is a group of proteins that also exhibits MeJA functions/utilities. 

Further, promoters of jasmonate responsive genes, as described in the Reference 
tables, for example, are useful to modulate transcription that is induced by jasmonate or any 
of the following phenotypes or biological activities below. 
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5 



10 



1 1 



25 



30 



V.E.I. USE OF JASMONATE RESPONSIVE GENES TO MODULATE 
PHENOTYPES: 

Jasmonate responsive genes and their gene products can be used to alter or modulate 
more of the following phenotypes: 
Growth rate 

Whole Plant, including Height, Flowering Time, etc. 

Seedling 

Organ 

Coleoptile Elongation 
Young Leaves 
Roots 

Lateral Roots 

Tuber Formation 

Flowers 

Fruit 

Seeds 

Biomass 

Fresh and Dry Weight during any time in plant life, including maturation and 
senescence 
Number of Flowers 
Number of Seeds 

Secondary Metabolite Accumulation 
Alkaloids 
Anthocyanins 

Paclitaxel and Related Taxanes 

Rosmarinic 

Seed Yield 

Number, Size, Weight, Harvest Index 

Content and Composition, e.g., Amino Acid, Jasmonate, Oil, Protein, 
and Starch 
Fruit Yield 

Number, Size, Weight, Harvest Index, Post Harvest Quality 
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Content and Composition e.g., Amino Acid, Carotenoid, Jasmonate, 
Protein, Starch 
Seed and Fruit Development 
Germination of Dormant and Non-Dormant Seeds 
Seed Viability 
Seed Reserve Mobilization 
Fruit Ripening 

Initiation of the Reproductive cycle from a vegetative state 
Flower Development Time 
Insect Attraction for Fertilization 
Time to Fruit Maturity 
Senescence 
Fruits, Fruit Drop 
Leaves 

Stress and Disease Responses 

Drought 

Wounding 

UV damage 

Insect, Fungus, Virus, Worm damage 

Further, any desired sequence can be transcribed in similar temporal, tissue, or 
enviromentally specific patterns as the jasmonate responsive genes when the desired 
sequence is operably linked to a promoter of a jasmonate responsive gene. 

To improve any of the phenotype(s) above, activities of one or more of the 
jasmonate responsive genes or gene products can be modulated and the plants can be 
tested by screening for the desired trait. Specifically, the gene, mRNA levels, or 
protein levels can be altered in a plant utilizing the procedures described herein and 
the phenotypes can be assayed, for example, in accordance to citations described 
below. 

V.E.2. USE OF JASMONATE-RESPONSIVE GENES TO MODULATE 

BIOCHEMICAL ACTIVITIES: 
The activities of one or more of the jasmonate responsive genes can be 
modulated to change biochemical or metabolic activities and/or pathways such as 



Page 446 of 772 




Attorney Diet. 27^U-14S3F 
Client Dkt. 80090.005 



those noted below. Such biological activities are documented and can be measured 
according to the citations included in the Table below: 
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PROCESS 


BIOCHEMICAL OR 
METABOLIC 
ACTIVITIES AND/OR 
PATHWAYS 


CITATIONS INCLUDING 
ASSAYS 


Turnover of proteins 


- Induction of various 
proteases, ubiquitin and 
proteosome components 
and turnover of RNA 
polymerases and 
translation initiation factors 
t Reduction in many 
ribosomal proteins 


This study. Standard 
biochemical assays. 


Activation of nitrogen 
metabolism 


Induction of glut amine 
synthetase, many 
aminotransferases, 
vegetative storage proteins 


Crawford (1995) Plant Cell 
7, 859-868 
This study. Standard 
biochemical assays. 


Lipid turnover 


Induction of various 
lipases, desaturases, and 
reduction of lipid transfer 
protein mRNAs 


This study. Standard 
biochemical assays. 


Sugar metabolism 


Induction of sugar 
transporters, UDP 
glucosyltransferases, other 
transferases 


This study. Standard 
biochemical assays. 
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Glycolysis and central 
carbon metabolism 


Induction of glycolytic 
related enzymes . Example, 
glucose 6-phosphate 
dehydrogenase, 
glyceraldehyde-3- 
phosphate dehydrogenase, 
phosphoglycerate kinase, 
phosphoglucomutase ATP 
synthase 


This study. Standard 
biochemical assays. 


Chlorosis 


Degradation of 
Chlorophyll 


Tsuchiya et al. (1999) Proc. 
Natl. Acad. Sci. U SA 
96:15362-15367 




Inhibition of 
Photosynthesis Related 
Proteins 


Reinbothe et al. (1993) J. 
Biol. Chem. 268, 10606- 
10611 


Carbon Assimilation and 
turnover 


Induction of chlorophyll ab 
binding protein precursor 


Reinbothe et al (1993) J. 
Biol Chem. 268, 10606- 
10611 


Jasmonate metabolism 


Induction of lipid 
biosynthesis, myrosinase 
and jacalin 


This study. Standard 
biochemical assays. 


Jasmonate mediated signal 
transduction 


Receptor binding 


Cho and Pai (2000) Mol 
Cells 10, 317-324 




Protein kinases 


Lee et al. (1998) Mol. Gen. 
Genet. 259,516-522 

Seo et al. (1999) Plant Cell 
11,289-298 

Yoon et al. (1999) Plant Mol. 
Biol. 39, 991-1001 
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Ubiquitination of 
Repressor Proteins 


Xie et al. (1998) Science 280, 
1091-1094 




Calcium Flux regulators 


Bergey and Ryan (1999) 
Plant Mol. Biol. 40,815-823 




Transcription Activators. 
Example- induction of 
various zinc finger, myb 
and AP-2 related factors 


Xiang et al. (1996) Plant 
Mol. Biol. 32,415-426 

Menke et al. (1999) EMBO J. 
18,4455-4463 


Response to Cell 
Membrane Damage 


Lipid Peroxidation 


Dubery et al. (2000) Mol. 
Cell Biol. Res. Commun. 3, 
105-110 


Cell Elongation 


Inhibition of incorporation 
of Glucose into Cell Wall 
Saccharides 


Burnett etal. (1993) Plant 
Physiol. 103,41-48 


Cell Organization and 
Division 


- Reductions in 
tropomyosin related 
proteins and certain cyclins 

- Induction of actins and 
tubulins 


Ishikawa et al. (1994) Plant 
Mol. Biol. 26, 403-414 


Cell Wall Turnover and 
modulation 


- Induction of cell wall 
proteins, glycine-rich 
proteins, annexins, pectate 
lyase and pectin esterases 

- Reductions in various 
dehydrins and expansins 


Creelman et al. (1992) Proc. 
Natl. Acad. Sci. USA 89, 
4938-4941 

Garcia-Muniz et al. (1998) 
Plant Mol. Biol. 38, 623-632 

Norman et al (1999) Mol. 
Plant Microbe Interact. 12, 
640-644 
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Stress, Disease, and 


Induction of antifungal 


Hildmann et al. (1992) Plant 


Pathogen Resistance 


proteins, wounding 


Cell 4, 1157-1170 




responsive proteins, 






dehydrins, heat shock type 


Reinbothe et al. (1994) Proc. 




proteins and elicitor 


XT J.1 A 1 O ' T TO A C\ 1 

Natl. Acad. Sci. USA 91, 




response proteins 


7012-7016 






Moons et al. (1997) Plant 






Cell 9, 2243-2259 






Richard et al. (2000) Plant 






Mol. Biol. 43, 1-10 






Van Wees et al. (2000) Proc. 






Natl. Acad. Sci. USA 97, 






8711-8716 




Phytoalexin Biosynthesis 


Creelman et al. (1992) Proc. 






Natl. Acad. Sci. USA 89, 






4938-4941 






Choi et al. (1994) Proc. Natl. 






Acad. Sci. USA 91, 2329- 






2333 




Biosynthesis of phenolics 


Doares et al., (1995) Proc. 






Natl. Acad. Sci. USA 92, 






4095-5098 




Production of Protease 


Botellaetal. (1996) Plant 




Inhibitors 


Physiol 112, 1201-1210 
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Defense Gene 
Transcription in Response 
toUV 


Mason et al. (1993) Plant 
Cell 5, 241-251 

Schaller et al. (2000) Planta 
210, 979-984 


Secondary Metabolite 
biosynthesis 


Fruit Cartenoid 
Composition 


Czapski and Saniewski 
(1992) J. Plant Physol. 139, 
265-268 




Palitaxel and Related 
Taxanes 


Yukimune et al. (1996) 
Nature Biotech. 14, 1129- 
1132 




Alkaloids 


Aerts et al. (1994) Plant J. 4, 
635-643 

Geerlings et al. (2000) J. 
Biol. Chem. 275, 3051-3056 




Anthocyanins 


Franceschi et al. (1991) Proc. 
Natl. Acad. Sci. USA 83, 
6745-6749 




Rosmarinic 


Mizukami et al., (1993) Plant 
CellReprod. 12, 706-709 




Activation of Ethylene- 
forming Enzyme and 
Production of Ethylene 


Czapski and Saniewski 
(1992) J. Plant Physiol. 139, 
265-268 



Other biological activities that can be modulated by the jasmonate responsive genes 
and their products are listed in the Reference Tables. Assays for detecting such biological 
activities are described in the Domain section of the Reference Tables. 

Jasmonate responsive genes are characteristically differentially transcribed in 
response to fluctuating jasmonate levels or concentrations, whether internal or external to an 
organism or cell. The MA_diff table(s) report(s) the changes in transcript levels of various 
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jasmonate responsive genes in the aerial parts of a seedling at 1 and 6 hours after being 
sprayed with Silwet L-77 solution enriched with methyl jasmonate as compared to seedlings 
sprayed with Silwet L-77 alone. 

The data from this time course reveal a number of types of jasmonate responsive 
genes and gene products, including "early responders" and "delayed responders". Profiles of 
the individual kinds of jasmonate responsive genes are shown in the Table below, together 
with examples of the kinds of associated biological activities that are modulated when the 
activities of such genes vary. 



GENE 


FUNCTIONAL 


TYPE OF 


EXAMPLES OF 


EXPRESSION 


CATEGORY OF 


BIOLOGICAL 


BIOCHEMICAL 


LEVELS 


GENE 


ACTIVITY 


ACTIVITY 


Upregulated 


Early Responders to 


Binding and 


Transcription Factors 


genes 


Jasmonate 


Perception of 


Transporters 


(Level at lhour 




Jasmonate 




= 6 hours). 






Kinases, Phosphatases, 


(Level atl hour 




Transduction of 


Leucine-rich Repeat 


> 6 hours) 




Jasmonate signal 


Proteins (LRRs), GTP- 






tranduction response 


binding proteins (G- 






pathways 


proteins), calcium- 








binding proteins and 








calcium responsive 








proteins 






Initiation of Specific 








Gene Transcription to 


Proteases,lipases, 






reorientate 


glutamine synthetase 






metabolism 


(GS), arginase, 








aminotransferases, 








glycosyltransferases, 








sugar transporters, cell 








wall proteins, methyl 








transferases, glycolytic 








enzymes. 


Upregulated 


Delayed Jasmonate 


Maintenance of 


Enzymes of methyl 
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GENE 


FUNCTIONAL 


TYPE OF 


EXAMPLES OF 


EXPRESSION 


CATEGORY OF 


BIOLOGICAL 


BIOCHEMICAL 


LEVELS 


GENE 


ACTIVITY 


ACTIVITY 


genes 


Responders 


Metabolism under 


j asmonate-induced 


(Level at lhour 




high Jasmonate 


pathways, including 


< 6 hours) 






dehydrin, phytoalexin, 








phenolic, carotenoid, 








alkaloid and 








anthocyanin 






Jasmonate signal 


biosynthesis. 






Tranduction Response 








Pathways 


Transcription factors, 








Transporters, Kinases 






Gene Transcription to 


and phosphatases 






Reorientate 








Metabolism 


Proteases, Lipases, 








Glutaminae 








Synthetase, Arginase, 








Aminotransferases, 








Lipid Peroxidases, 








Glycosyltransferases, 








Sugar transporters, 








Cell Wall Proteins, 






Gene Transcription to 


Glycolytic Enzymes, 






Maintain Reorientated 


Chlorophyll Binding 






Metabolism 


Proteins 








Transcription factors, 








kinases, phosphatases, 








LRRs, G-proteins 
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GENE 


FUNCTIONAL 


TYPE OF 


EXAMPLES OF 


EXPRESSION 


CATEGORY OF 


BIOLOGICAL 


BIOCHEMICAL 


LEVELS 


GENE 


ACTIVITY 


ACTIVITY 






Reorient Cell 


Actins, Tubulins, 






Division and Cell 


Myosins Cyclins, 






Development 


Cyclin-dependent 








Kinases (CDPKs) 








Glycosyl Transferases, 








Glycosyl hydrolases, 








Expansins, Extensins, 








O- Methyl 








Transferases 








Arabinogalactan- 








proteins (AGPs), 








Enzymes of Lipid 








Biosynthesis, Cutinase 


Down regulated 


Early responders of 


Relese of Suppression 


Transcription Factors, 


transcripts 


Jasmonate 


of Jasmonate Induced 


Kinases, Phosphatases, 


(level at 1 hour = 




Pathways 


LRRs, G-Proteins, 


6 hours) 






Chromatin 


( level at 6 hours 






Restructuring proteins, 


>1 hour) 


Genes with 


Reorientation of 






discontinued 


metabolism 


Ribosomal proteins, 




expression or 




Translation Factors, 




unsTable mRNA 




Histones, RNA 




following Jasmonate 




polymerases, Pectin 




uptake 




esterase, Lipid transfer 








proteins 
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GENE 


FUNCTIONAL 


TYPE OF 


EXAMPLES OF 


EXPRESSION 


CATEGORY OF 


BIOLOGICAL 


BIOCHEMICAL 


LEVELS 


GENE 


ACTIVITY 


ACTIVITY 


Down regulated 


Genes with 


Negative Regulation 


Transcription factors 

Mr 


transcripts 


Discontinued 


of Jasmonate Induced 


Kinases, Phosphatases 

5 ST 


(level at 1 hour > 


expression or 


Pathways Released. 


Chromatin 


6 hours) 


UnsTable mRNA 




Restructuring Proteins, 




Following Jasmonate 




LRRs, G-proteins 




uptake 










Reorientation of 








metabolism 


Ribosomal proteins, 








Translation Factors, 








Histones 








RNA Polymerases, 








Cyclins 








Pectin esterase, Lipid 








Transfer Proteins 



Use of Promoters of Jasmonate Responsive Genes 

Promoters of Jasmonate responsive genes are useful for transcription of any desired 
polynucleotide or plant or non-plant origin. Further, any desired sequence can be transcribed 
in a similar temporal, tissue, or environmentally specific patterns as the Jasmonate responsive 
genes where the desired sequence is operably linked to a promoter of a Jasmonate responsive 
gene. The protein product of such a polynucleotide is usually synthesized in the same cells, 
in response to the same stimuli as the protein product of the gene from which the promoter . 
was derived. Such promoter are also useful to produce antisense mRNAs to down-regulate 
the product of proteins, or to produce sense mRNAs to down-regulate mRNAs via sense 
suppression. 
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V.F. REACTIVE OXYGEN RESPONSIVE GENES, GENE COMPONENTS 
AND H2Q2 PRODUCTS 

Often growth and yield are limited by the ability of a plant to tolerate stress conditions, 
including pathogen attack, wounding, extreme temperatures, and various other factors. To 
combat such conditions, plant cells deploy a battery of inducible defense responses, including 
triggering an oxidative burst. The burst of reactive oxygen intermediates occurs in time, place 
and strength to suggest it plays a key role in either pathogen elimination and/or subsequent 
signaling of downstream defense functions. For example, H2O2 can play a key role in the 
pathogen resistance response, including initiating the hypersensitive response (HR). HR is 
correlated with the onset of systemic acquired resistance (SAR) to secondary infection in distal 
tissues and organs. 

Changes in reactive oxygen, such as H2O2 or 0 2 " , in the surrounding environment or in 
contact with a plant results in modulation of the activities of many genes and hence levels of 
gene products. Examples of such reactive oxygen responsive genes and gene products are 
shown in the Reference, Sequence, Protein Group, Protein Group Matrix, MA_diff and 
MA_clust tables. These genes and/or products are responsible for effects on traits such as plant 
vigor and seed yield. The genes were discovered and characterized from a much larger set by 
experiments designed to find genes whose mRNA products changed in response to application 
of reactive oxygen, such as H2O2, to plants. 

Manipulation of one or more reactive oxygen responsive gene activities are useful to 
modulate the following biological activities and/or phenotypes listed below. Reactive oxygen 
responsive genes and gene products can act alone or in combination. Useful combinations 
include reactive oxygen responsive genes and/or gene products with similar transcription 
profiles, similar biological activities, or members of the same or functionally related 
biochemical pathways. Whole pathways or segments of pathways are controlled by 
transcription factor proteins and proteins controlling the activity of signal transduction 
pathways. Therefore, manipulation of such protein levels is especially useful for altering 
phenotypes and biochemical activities of plants. 

Such reactive oxygen responsive genes and gene products can function to either 
increase or dampen the above phenotypes or activities either in response to changes in 
reactive oxygen concentration or in the absence of reactive oxygen fluctuations. The MA_diff 
Table(s) reports the transcript levels of the experiment (see EXPT ID: 108582, 108583, 
108537, 108538, 108558, and H202 (relating to SMD 7523)). For transcripts that had higher 



Page 457 of 772 



• Attorney UKX. 
Client Dkt. 80090.005 

levels in the samples than the control, a "+" is shown. A "-" is shown for when transcript 
levels were reduced in root tips as compared to the control. For more experimental detail see 
the Example section below. 

Reactive Oxygen genes are those sequences that showed differential expression as 
5 compared to controls, namely those sequences identified in the MA_diff tables with a "+" or 
"-" indication 

Reactive Oxygen Genes Identified By Cluster Analyses Of Differential 
Expression 

10 Reactive Oxygen Genes Identified By Correlation To Genes That Are 

Differentially Expressed 

As described above, the transcription profiles of genes that act together are well 
correlated. Applicants not only have identified the genes that are differentially expressed in 
C the microarray experiments, but also have identified the genes that act in concert with them. 
J=5 The MA_clust table indicates groups of genes that have well correlated transcription profiles 
% and therefore participate in the same pathway or network. 

fij A pathway or network of Reactive Oxygen genes is any group in the MA_clust that 

gj comprises a cDNA ID that also appears in Expt ID 108582, 108583, 108537, 108538, 
% 108558, and H202 (relating to SMD 7523) of the MA_diff table(s). 
20 

5 £ Reactive Oxygen Genes Identified By Correlation To Genes That 

p Cause Physiological Consequences 

Additionally, the differential expression data and the phenotypic observations can be 

merged to identify pathways or networks of Reactive Oxygen genes. A group in the 
25 MA_clust is considered a Reactive Oxygen pathway or network if the group comprises a 

cDNA ID that also appears in Knock-in or Knock-out tables that causes one or more of the 

phenotypes described in section above. 

Reactive Oxygen Genes Identified By Amino Acid Sequence Similarity 
30 Reactive Oxygen genes from other plant species typically encode polypeptides that 

share amino acid similarity to the sequences encoded by corn and Arabidopsis Reactive 
Oxygen genes. Groups of Reactive Oxygen genes are identified in the Protein Group table. 
In this table, any protein group that comprises a peptide ID that corresponds to a cDNA ED 
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member of a Reactive Oxygen pathway or network is a group of proteins that also exhibits 
Reactive Oxygen functions/utilities. 

Further, promoters of reactive oxygen responsive genes, as described in the Reference 
tables, for example, are useful to modulate transcription that is induced by reactive oxygen or 
any of the following phenotypes or biological activities below. 

V.F.I. USE OF REACTIVE OXYGEN RESPONSIVE GENES TO 
MODULATE PHENOTYPES 
Reactive oxygen responsive genes and gene products are useful to or modulate one or 
more of the following phenotypes: 

Pathogen Tolerance and/or Resistance 

- Avr/R locus sensitive 
Non-Host sensitive 

- HR 

- SAR, e.g., where the reactive oxygen responsive gene and products are modulated 
in conjuntion with any of the bacterial, fungal, virus, or other organism listed 
below 

- Bacteria, resistance e.g. to Erwinia stewartii, Pseudomonas syringae, 
Pseudomonas tabaci, Stuart's wilt, etc. 

- Fungal resistance including to downy mildews such as Scleropthora macrospora, 
Sclerophthora rayissiae, Sclerospora graminicola, Peronosclerospora sorghi, 
Peronosclerospora philippinensis, Peronosclerospora sacchari, Peronosclerospora 
maydis; rusts such as Puccinia sorphi, Puccinia polysora, Physopella zeae, etc.; 
other fungal diseases such as Cercospora zeae-maydis, Colletotrichum 
graminicola, Fusarium monoliforme, Exserohilum turcicum, Bipolaris maydis, 
Phytophthora parasitica, Peronospora tabacina, Septoria, etc.; 

Virus or viroid resistance, e.g. to tobacco or cucumber mosaic virus, ringspot 
virus, necrosis virus, pelargonium leaf curl virus, red clover mottle virus, tomato 
bushy stunt virus, and like viruses; 

- Insect resistance, such as to aphids e.g. Myzus persicae; beetles, beetle larvae; etc. 

- nematodes, e.g. Meloidogyne incognita; lepidoptera, e.g. Heliothus spp. etc. 

- Resistance Specifically in Primary or Secondary Leaves 
Stress Tolerance 
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Winter Survival 

- Cold Tolerance 

- Heavy Metal Tolerance, such as Cadmium 

- Physical Wounding; 

5 - Increased Organelle Tolerance to Redox Stress, such as in Mitochondria, and 

chloroplasts 

- Cell Death 

- Apoptosis, including death of diseased tissue; 
Senescence; 

10 - Fruit Drop; 

- Biomass 

- Fresh and Dry Weight during any time in plant life, such as maturation; 

- Number of Flowers, Seeds, Branches, and/or Leaves; 

G - Seed Yield, including Number, Size, Weight, and/or Harvest Index 

J5 - Fruit Yield, including Number, Size, Weight, and/or Harvest Index 

2? - Plant Development 

E 1 

W - Time to Fruit Maturity 

Ul 

rTi - Cell Wall Strengthening and Reinforcement 

Plant Product Quality 
Paper making quality 
~ - Food additives 

3 - Treatment of Indications modulated by Free Radicals 

Cancer 

To regulate any of the phenotype(s) above, activities of one or more of the reactive 
25 oxygen responsive genes or gene products can be modulated and the plants can be tested by 
screening for the desired trait. Specifically, the gene, mRNA levels, or protein levels can be 
altered in a plant utilizing the procedures described herein and the phenotypes can be screened 
for variants as in Winkler et al. (1998) Plant Physiol 118: 743-50 and assayed , for example, in 
accordance to Alvarez et al, (1998) Cell 92: 773-784; Halhbrock and Scheel, (1989) Ann. Rev. 
30 Plant Physiol. Plant Mol. Biol. 40: 347-369; Lamb et al., (1997) Ann. Rev. Plant Mol. Biol. 
Plant Physio. 48: 251-275; Lapwood et al. (1984) Plant Pathol. 33: 13-20; Levine et al. (1996) 
Curr. Biol. 6: 427-437; McKersie et al., (2000) Plant Physiol. 122(4): 1427-1437; Olson and 
Varner (1993) Plant J. 4: 887-892; Pastore et al., (2000), FEBS Lett 470(1): 88-92; Pastori et al., 
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(1997) Plant Physiol. 113: 411-418.Romero-Puertas et al., (1999) Free Radic. Res. 1999 31 
Suppl: S25-31; Shirataki et al., Anticancer Res 20(1 A): 423-426 (2000); Wu et al., (1995) Plant 
Cell 7: 1357-1368; 
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V.F.2. USE OF REACTIVE OXYGEN RESPONSIVE GENES TO 
MODULATE BIOCHEMICAL ACTIVITIES 

The activities of one or more of the reactive oxygen responsive genes can be 
modulated to change biochemical or metabolic activities and/or pathways such as 
those noted below. Such biological activities are documented and can be measured 
according to the citations above and included in the Table below: 



PROCESS 


BIOCHEMICAL OR 


CITATIONS INCLUDING 




METABOLIC ACTIVITIES 


ASSAYS 




AND/OR PATHWAYS 




Reinforcement of 


Modulation Of The Production Of 


Bradley et al. 1992. Cell 70, 


Cell Walls 


ExtracTable Pro line-Rich Protein 


21-30 




Modulation Of Lignification 


Mansouri et al. (1999) Physiol 






Plant 106: 355-362 


Stress, Disease, 


Induction Of Pathogenesis Related 


Chamnongpol et.al.(1998) 


Pathogen Resistance 


Proteins, Phytoalexins And Many 


Proc. Nat.Acad Sci USA 


and Wounding 


Defense Pathways. 


12;95:5818-23. 






Davis etal. (1993) 




Induction Of Detoxifying 


Phytochemistry 32: 607-611. 




Enzymes Such As Glutathione S- 


Chen etal. Plant J. (1996) 




Transferase And Ascorbate 


10:955-966 




Peroxidase 


Gadea et.al.(1999) Mol Gen 




Disease Resistance 


Genet 262:212-219 






Wu et.al.(1995) Plant Cell 7: 






1357-68 




Reactive Oxygen Generation 


Orozco-Cardenas and Ryan 




Following Wounding And 


(1999) Proc.Nat. Acad. Sci. 




Changes In Physical Pressure 


USA 25;96:6553-7. 






Yahraus et al. (1995) Plant 






Physiol. 109: 1259-1266 
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PROCESS 


BIOCHEMICAL OR 
METABOLIC ACTIVITIES 
AND/OR PATHWAYS 


CITATIONS INCLUDING 
ASSAYS 




Modulation Of Genes Involved In 
Wound Repair And Cell Division 


egendre et al. (1993) Plant 
hysiol. 102: 233-240 




Modulation Of Nitric Oxide 
Signaling 


Delledonne et al. (1998) 
Nature 394: 585-588 




Salicyclic Acid Accumulation And 
Signaling 


Durner and Klessig (1996) 
J.Biol.Chem. 271:28492-501 


Programmed Cell 
Death 


Induction Of Cell Death Pathway 
Genes 


Levine et al. (1996) Curr. Biol. 
6: 427-437. Reynolds 
et.al.(1998) Biochem.J. 
330:115-20 



Other biological activities that can be modulated by the reactive oxygen responsive 
genes and their products are listed in the Reference tables. Assays for detecting such 
biological activities are described in the Protein Domain table. 

Reactive oxygen responsive genes are characteristically differentially transcribed in 
response to fluctuating reactive oxygen levels or concentrations, whether internal or external 
to an organism or cell. The MA_diff table reports the changes in transcript levels of various 
reactive oxygen responsive genes in the aerial parts of a plant at 1 and 6 hours after the plant 
was sprayed with Silwett L-77 solution enriched with hydrogen peroxide as compared to 
plants sprayed with Silwett L-77 alone. 

The data from this time course reveal a number of types of reactive oxygen responsive 
genes and gene products, including "early responders," and "delayed responders". Profiles of 
individual reactive oxygen responsive genes are shown in the Table below together with 
examples of which associated biological activities are modulated when the activities of one or 
more such genes vary in plants. 



GENE 


FUNCTIONAL 


PHYSIOLOGICAL 


EXAMPLES OF 


EXPRESSION 


CATEGORY OF 


CONSequence 


BIOCHEMICAL 


LEVELS 


GENE 




ACTIVITY 








OF GENE PRODUCTS 
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Upregulated 


Early Responders 


- Perceiving 


-Transcription Factors 


transcripts 


To 


Reactive Oxygen 


-Kinases And Phosphatases 


(Higher at Ih 


Reactive Oxygen 




-Transporters 


Than 6h) 




- Reactive Oxygen 


- Glutathione S-Transferase 


(Level at 1 h = 




Response 


-Heat Shock Proteins 


6h) 




Transduction 


-Salicylic Acid Response 






Pathways 


Pathway Proteins 








-Jasmonic Acid Pathway 






- Initiating Specific 


Proteins 






Gene Transcription 


-Dehydrins 








-Peroxidases 








-Catalase 








-Proteases 








-Pathogen Response Proteins 








-Ca 2+ Channel Blockers 








-Phenylalanine Ammonia 








Lyase 


Upregulated 


Delayed Reactive 


Maintenance Of 


-Transcription Factors 


transcripts 


Oxygen 


Defence Pathways 


- Kinases And Phosphatases 


(Lower at lh 


Responders 


To Control Active 


- Reactive Oxygen 


Than 6h) 




Oxygen 


Scavenging Enzymes 








- Cell Wall And Cell 








Division/Growth Promoting 






Activation Of Cell 


Pathway Enzymes 






Death Pathways In 


- Pathogen Response 






Specific Cells- 


Proteins 








- Proteins Of Defence 








Pathways 








- Proteases, Cellulases, 








Nucleases And Other 








Degrading Enzymes. 








- Membrane Proteins 








-Mitochondrial And 
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Chloroplast Energy Related 
Proteins 


Downregulated 


Early Responder 


Negative 


-Transcription Factors 


transcripts 


Repressors Of 


Regulation Of 


- Kinases And Phosphatases 


Level at lh = 6h 


Reactive Oxygen 


Reactive Oxygen- 


- Chromatin Remodelling 


Level at 6h> lh. 


Response 


Inducible Pathways 


Proteins 




Pathways 


Released 






Genes Of 








Pathways That 


Reduction In 


- Metabolic Enzymes In 




Are Minimized In 


Activities Of 


Affected Cells 




Response To 


Pathways Not 


- Membrane Proteins And 




Reactive Oxygen 


Maintained Under 


Cell Wall Proteins 


Down Regulated 


Delayed 


High Reactive 


-Transcription Factors 


Transcripts 


Responder 


Oxygen 


- Kinases And Phosphatases 


(Level atlh>6h 


Repressors Of 


Negative 


-Chromatin Remodelling 




Reactive Oxygen 


Regulation Of 


Proteins 




Response 


Reactive Oxygen 






Pathways 


Inducible Pathways 


- Metabolic Enzymes In 






Released 


Affected Cells 




Genes Of 




- Membrane Proteins And 




Pathways That 


Reduction In 


Cell Wall Proteins 




Are Minimised In 


Activities Of 






Response To 


Pathways Not 


- Many Proteins In Cells 




Reactive Oxygen 


Maintained Under 


Undergoing Cell Death Or In 






Reactive Oxygen 


Damaged Cells 






Programmed Cell 








Death 





Further, promoters of reactive oxygen responsive genes, as described in the Reference 
tables, for example, are useful to modulate transcription that is induced by reactive oxygen or 
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V.G. SALICYLIC ACID RESPONSIVE GENES, GENE COMPONENTS AND 
PRODUCTS 

Plant defense responses can be divided into two groups: constitutive and induced. 
Salicylic acid (SA) is a signaling molecule necessary for activation of the plant induced 
defense system known as systemic acquired resistance or SAR. This response, which is 
triggered by prior exposure to avirulent pathogens, is long lasting and provides protection 
against a broad spectrum of pathogens. Another induced defense system is the hypersensitive 
response (HR). HR is far more rapid, occurs at the sites of pathogen (avirulent pathogens) 
entry and precedes SAR. SA is also the key signaling molecule for this defense pathway. 

Changes in S A concentration in the surrounding environment or within a plant results in 
modulation of many genes and gene products. Examples of such SA responsive genes and gene 
products are shown in the Reference, Sequence, Protein Group, Protein Group Matrix, MA_diff 
and MA_clust tables. These genes and/or products are responsible for effects on traits such as 
plant vigor and seed yield. They were discovered and characterized from a much larger set by 
experiments designed to find genes whose mRNA products changed in response to SA 
treatment. 

While SA responsive polynucleotides and gene products can act alone, combinations 
of these polynucleotides also affect growth and development. Useful combinations include 
different SA responsive polynucleotides and/or gene products that have similar transcription 
profiles or similar biological activities, and members of the same or similar biochemical 
pathways. In addition, the combination of SA responsive polynucleotides and/or gene 
product with another environmentally responsive polynucleotide is also useful because of the 
interactions that exist between hormone regulated pathways, stress and pathogen induced 
pathways, nutritional pathways and development. Here, in addition to polynucleotides 
having similar transcription profiles and/or biological activities, useful combinations include 
polynucleotides that may have different transcription profiles but which participate in 
common and overlapping pathways. 

Such SA responsive genes and gene products can function to either increase or 
dampen the above phenotypes or activities either in response to changes in SA concentration 
or in the absence of SA fluctuations. The MA_diff Table(s) reports the transcript levels of the 
experiment (seeEXPTID: 108586, 108587, 108515, 108552, 108471, 108472, 108469, 
108470, 107953, 107960, 108443, 108440, 108441, 108475, 108476). For transcripts that 
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had higher levels in the samples than the control, a "+" is shown. A "-" is shown for when 
transcript levels were reduced in root tips as compared to the control. For more experimental 
detail see the Example section below. 

SA genes are those sequences that showed differential expression as compared to controls, 
namely those sequences identified in the MA_diff tables with a or "-" indication. 

SA Genes Identified By Cluster Analyses Of Differential Expression 



As described above, the transcription profiles of genes that act together are well 
correlated. Applicants not only have identified the genes that are differentially expressed in 
the microarray experiments, but also have identified the genes that act in concert with them. 
The MA_clust table indicates groups of genes that have well correlated transcription profiles 
and therefore participate in the same pathway or network. 

A pathway or network of S A genes is any group in the MA_clust that comprises a 
cDNA ID that also appears in Expt ID 108586, 108587, 108515, 108552, 108471, 108472, 
108469, 108470, 107953, 107960, 108443, 108440, 108441, 108475, 108476 of theMA_diff 
table(s). 

SA Genes Identified By Correlation To Genes That Cause Physiological 
Consequences 

Additionally, the differential expression data and the phenotypic observations can be 
merged to identify pathways or networks of SA genes. A group in the MA_clust is 
considered a SA pathway or network if the group comprises a cDNA ID that also appears in 
Knock-in or Knock-out tables that causes one or more of the phenotypes described in section 
above. 

SA Genes Identified By Amino Acid Sequence Similarity 

S A genes from other plant species typically encode polypeptides that share amino 
acid similarity to the sequences encoded by corn and Arabidopsis SA genes. Groups of SA 
genes are identified in the Protein Group table. In this table, any protein group that comprises 
a peptide ID that corresponds to a cDNA ID member of a SA pathway or network is a group 
of proteins that also exhibits SA functions/utilities. 



SA Genes Identified By Correlation To Genes That Are Differentially 



Expressed 
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Further, promoters of SA responsive genes, as described in the Reference tables, for 
example, are useful to modulate transcription that is induced by S A or any of the following 
phenotypes or biological activities below. 

V.G.I. USE OF SALICYLIC ACID-RESPONSIVE GENES TO 
MODULATE PHENOTYPES 
SA responsive genes and gene products are useful to or modulate one or more of the 
following phenotypes: 

- Pathogen Tolerance and/or Resistance 
Avr/R locus Interactions 

- Non-Host Interactions 

- HR 

- SAR, e.g., SA Responsive Genes And/Or Products In Conjuction With Any Of 
The Organisms Listed Below 

Resistance To Bacteria e.g. to Erwinia stewartii, Pseudomonas syringae, 
Pseudomonas tabaci, Stuart's wilt, etc. 

- Resistance To Fungi e.g. to Downy Mildews Such As Scleropthora macrospora, 
Sclerophthora rayissiae, Sclerospora graminicola, Peronosclerospora sorghi, 
Peronosclerospora philippinensis, Peronosclerospora sacchari, Peronosclerospora 
maydis; Rusts Such As Puccinia sorphi, Puccinia polysora, Physopella zeae, etc.; 
And To Other Fungal Diseases e.g. Cercospora zeae-maydis, Colletotrichum 
graminicola, Fusarium monoliforme, Exserohilum turcicum, Bipolaris maydis, 
Phytophthora parasitica, Peronospora tabacina, Septoria, etc.; 

- Resistance To Viruses Or Viroids e.g., To Tobacco Or Cucumber Mosaic Virus, 
Ringspot Virus, Necrosis Virus, Pelargonium Leaf Curl Virus, Red Clover Mottle 
Virus, Tomato Bushy Stunt Virus, And Like Viruses; 

- Resistance To Insecs, Such As To Aphids e.g. Myzus persicae; to Beetles And 
Beetle Larvae; to lepidoptera larvae e.g. Heliothus etc. 

- Resistance to Nematodes, e.g. Meloidogyne incognita etc 

- Local Resistance In Primary (Infected) Or Secondary (Uninfected) Leaves 
Stress Tolerance 

- Winter Survival 
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Cold Tolerance 

- Salt Tolerance 

- Heavy Metal Tolerance, Such As Cadmium 

- Tolerance To Physical Wounding; 

- Increased Organelle Tolerance To Redox Stress, Such As In Mitochondria, And 
Chloroplasts 

- Cell Death 

- Programmed Cell Death, Including Death Of Diseased Tissue And During 
Senescence 

- Fruit Drop 

- Biomass 

- Fresh And Dry Weight During Any Time In Plant Life, Such As Maturation 

- Number Of Flowers, Seeds, Branches, And/Or Leaves 

- Seed Yield, Including Number, Size, Weight, And/Or Harvest Index 

- Fruit Yield, Including Number, Size, Weight, And/Or Harvest Index 

- Plant Development 

- Time To Fruit Maturity 

- Cell Wall Strengthening And Reinforcement 

- Plant Product Quality 

Paper Making Quality 

- Food Additives 

- Treatment Of Indications Modulated By Free Radicals 

- Cancer 

To regulate any of the desired phenotype(s) above, activities of one or more of the SA 
responsive genes or gene products can be modulated and the plants tested by screening for 
the desired trait. Specifically, the gene, mRNA levels, or protein levels can be altered in a 
plant utilizing the procedures described herein and the phenotypes can be assayed. As an 
example, a plant can be transformed according to Bechtold and Pelletier (1998, Methods. 
Mol. Biol. 82:259-266) and/or screened for variants as in Winkler et al. (1998) Plant Physiol 
118: 743-50 and visually inspected for the desired phenotype or metabolically and/or 
functionally assayed according to Zhao et al. (1998, Plant Cell 10:359-70) and Alvarez et al. 
(1998, Cell 92:733-84). 
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V.G.2. USE OF SALICYLIC ACID-RESPONSIVE GENES TO 
MODULATE BIOCHEMICAL ACTIVITIES 

The activities of one or more of the SA responsive genes can be modulated to 
change biochemical or metabolic activities and/or pathways such as those noted 
below. Such biological activities can be measured according to the citations included 
in the Table below: 



PROCESS 


BIOCHEMICAL OR 
METABOLIC ACTIVITIES 
AND/OR PATHWAYS 


CITATION INCLUDING 
ASSAYS 


Protection From Microbial 
Pathogens 


Systemic Acquired Resistance 
(SAR) 

- Phytoalexin Biosynthesis 

- PR Protein Biosynthesis 
Local Resistance 
Wound Response 


Alvarez et al. (1998) Cell 
92: 733-84 

Lapwood et al. (1984) Plant 
Pathol. 33: 13-20 
Davis et al. (1993) 
Phytochemistry 32: 607-1 1 
Yahraus et al. (1995) Plant 
Physiol. 109: 1259-66 


Cell Signaling 


- Modulation Of Reactive 
Oxygen Signaling 

- Modulation Of No Signaling 


-Alvarez et al. (1998) Cell 
92: 773-784 . 
Delledonne etal. (1998) 
Nature 394: 585-588 


Growth And Development 


- Lignification 


Redman et al. (1999) Plant 
Physiol. 119: 795-804 



Other biological activities that can be modulated by the S A responsive genes and gene 
products are listed in the Reference tables. Assays for detecting such biological activities are 
described in the Protein Domain table. 

Salicylic acid responsive genes are characteristically differentially transcribed in 
response to fluctuating SA levels or concentrations, whether internal or external to an 
organism or cell. The MA_diff table reports the changes in transcript levels of various SA 
responsive genes in entire seedlings at 1 and 6 hours after the seedling was sprayed with a 
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Hoagland's solution enriched with SA as compared to seedlings sprayed with Hoagland's 
solution only. 

The data from this time course can be used to identify a number of types of SA 
responsive genes and gene products, including "early responders" and "delayed responders." 
Profiles of these different SA responsive genes are shown in the Table below together with 
examples of the kinds of associated biological activities. 



GENE 


FUNCTIONAL 


PHYSIOLOGICAL 


EXAMPLES OF 


EXPRESSION 


CATEGORY 


CONSEQUENCES 


BIOCHEMICAL 


LEVELS 


OF GENE 




ACTIVITIES OF GENE 
PRODUCTS 


Upregulated Genes 


Early 


- SA Perception 


-Transcription Factors 


(Level At lh=6h) 


Responders To 


- SA Uptake 


-Transporters, Kinases, 


Or 


SA 


-Modulation Of SA 


Phosphatases, G- 


(Level At lh > 6h) 




Response Transduction 
Pathways 


Proteins, LRR, DNA 
Remodelling Proteins 


Upregulated Genes 
(Level At lh < 6h) 


Delayed 
Responders To 
SA 


- Specific Defensegene 
Transcription Initiation 
(E.G. Pr Genes, Pal 


-Proteases, PRProteins, 
Cellulases, Chitinases, 
Cutinases, Other 
Degrading Enzymes, Pal, 
Proteins Of Defense 
Pathways, Cell Wall 
Proteins 

Epoxide Hydrolases, 
Methyl Transferases 


Downregulated 


- Early 


- Negative Regulation 


Transcription factors, 


(Level At lh = 6h) 


Responder 


OfSA Inducible 


kinases, phosphatases, G- 


Or 


Repressors To 


Pathways Released 


proteins, LRR, 


(Level At 6h>lh) 


SA 

- Genes With 




transporters, calcium 
binding proteins, 
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GENE 


FUNCTIONAL 


PHYSIOLOGICAL 


EXAMPLES OF 


EXPRESSION 


CATEGORY 


CONSEQUENCES 


BIOCHEMICAL 


LEVELS 


OF GENE 




ACTIVITIES OF GENE 








PRODUCTS 




Discontinued 




chromatin remodelling 




Expression Or 




protein 




UnsTable 








mRNA In The 








Presence Of S A 






Down-Regulated 


- Delayed 


Negative Regulation Of 


Transcription Factors, 


Transcripts 


Responders To 


SA Inducible Pathways 


Kinases, Phosphatases, 


(Level At lh > 6h) 


SA Metabolism 


Released 


G-Proteins, LRR, 




- Genes With 




Transporters, Calcium 




Discontinued 




Binding Proteins, 




Expression Or 




Chromatin Remodelling 




UnsTable 




Protein 




mRNA In The 








Presence Of S A 







Further, any desired sequence can be transcribed in similar temporal, tissue, or 
enviromentally specific patterns as the SA responsive genes when the desired sequence is 
operably linked to a promoter of a S A responsive gene. 
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V.H. NITRIC OXIDE RESPONSIVE GENES, GENE COMPONENTS AND 
PRODUCTS 

The rate-limiting element in plant growth and yield is often its ability to tolerate 
suboptimal or stress conditions, including pathogen attack conditions, wounding and the 
presence of various other factors. To combat such conditions, plant cells deploy a battery of 
inducible defense responses, including synergistic interactions between nitric oxide (NO), 
reactive oxygen intermediates (ROS), and salicylic acid (SA). NO has been shown to play a 
critical role in the activation of innate immune and inflammatory responses in animals. At 
least part of this mammalian signaling pathway is present in plants, where NO is known to 
potentiate the hypersensitive response (HR). In addition, NO is a stimulator molecule in plant 
photomorphogenesis. 

Changes in nitric oxide concentration in the internal or surrounding environment, or 
in contact with a plant, results in modulation of many genes and gene products. Examples of 
such nitric oxide responsive genes and gene products are shown in the Reference and 
Sequence Tables. These genes and/or products are responsible for effects on traits such as 
plant vigor and seed yield. They were discovered and characterized from a much larger set 
by experiments designed to find genes whose mRNA products changed in response to nitric 
oxide treatment. 

While nitric oxide responsive polynucleotides and gene products can act alone, 
combinations of these polynucleotides also affect growth and development. Useful 
combinations include different nitric oxide responsive polynucleotides and/or gene products 
that have similar transcription profiles or similar biological activities, and members of the 
same or similar biochemical pathways. Whole pathways or segments of pathways are 
controlled by transcription factor proteins and proteins controlling the activity of signal 
transduction pathways. Therefore, manipulation of the levels of such proteins is especially 
useful for altering phenotypes and biochemical activities of plants. In addition, the 
combination of a nitric oxide responsive polynucleotide and/or gene product with other 
environmentally responsive polynucleotides is also useful because of the interactions that 
exist between hormone regulated pathways, stress pathways, pathogen stimulated pathways, 
nutritional pathways and development. Here, in addition to polynucleotides having similar 
transcription profiles and/or biological activities, useful combinations include 
polynucleotides that may have different transcription profiles but which participate in 
common or overlapping pathways. The MA_diff Table(s) reports the transcript levels of the 
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experiment (see EXPT ID: 108584, 108585, 108526, 108527, 108559). For transcripts that 
had higher levels in the samples than the control, a "+" is shown. A "-" is shown for when 
transcript levels were reduced in root tips as compared to the control. For more experimental 
detail see the Example section below. 

NO genes are those sequences that showed differential expression as compared to 
controls, namely those sequences identified in the MA_diff tables with a "+" or "-" indication 

NO Genes Identified By Cluster Analyses Of Differential Expression 

NO Genes Identified By Correlation To Genes That Are Differentially 

Expressed 

As described above, the transcription profiles of genes that act together are 
well correlated. Applicants not only have identified the genes that are differentially 
expressed in the microarray experiments, but also have identified the genes that act in concert 
with them. The MA_clust table indicates groups of genes that have well correlated 
transcription profiles and therefore participate in the same pathway or network. 

A pathway or network of NO genes is any group in the MA_clust that 
comprises a cDNA ID that also appears in Expt ID 108584, 108585, 108526, 108527, 108559 
of the MA_difftable(s). 

NO Genes Identified By Correlation To Genes That Cause Physiological 

Consequences 

Additionally, the differential expression data and the phenotypic observations 
can be merged to identify pathways or networks of NO genes. A group in the MA_clust is 
considered a NO pathway or network if the group comprises a cDNA ID that also appears in 
Knock-in or Knock-out tables that causes one or more of the phenotypes described in section 
above. 

NO Genes Identified By Amino Acid Sequence Similarity 

NO genes from other plant species typically encode polypeptides that share 
amino acid similarity to the sequences encoded by corn and Arabidopsis NO genes. Groups 
of NO genes are identified in the Protein Group table. In this table, any protein group that 
comprises a peptide ID that corresponds to a cDNA ED member of a NO pathway or network 
is a group of proteins that also exhibits NO functions/utilities. 
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Such nitric oxide responsive genes and gene products can function either to increase 
or dampen the above phenotypes or activities either in response to changes in nitric oxide 
concentration or in the absence of nitric oxide fluctuations. Further, promoters of nitric oxide 
responsive genes, as described in the Reference tables, for example, are useful to modulate 
transcription that is induced by nitric oxide or any of the following phenotypes or biological 
activities below. 

V.H.I. USE OF NITRIC OXIDE-RESPONSIVE GENES TO MODULATE 
PHENOTYPES : 

Nitric oxide responsive genes and gene products are useful to or modulate one or 
more of the following phenotypes: 

• Stress Responses 

Mediation of response to stresses 

• Disease resistance 

• Growth 
Roots 
Stems 

• Leaves 
. Cells 

• Promotes leaf cell elongation 

• Biomass 

• Fresh and Dry Weight during any time in plant life, such as at maturation; 

• Size and/or Weight 

• Flowers 
Seeds 

• Branches 
Leaves 

• Roots 

• Development 

• Seed Development 

• Dormancy 
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• Control rate and timing of germination 

• Prolongs seed storage and viability 
Senescence 

Further, any desired sequence can be transcribed in similar temporal, tissue, or 
enviromentally specific patterns as the nitric responsive genes when the desired sequence is 
operably linked to a promoter of a nitric responsive gene. 

To regulate any of the desired phenotype(s) above, activities of one or more of the 
nitric oxide responsive genes or gene products can be modulated and the plants tested by 
screening for the desired trait. Specifically, the gene, mRNA levels, or protein levels can be 
altered in a plant utilizing the procedures described herein and the phenotypes can be 
assayed. As an example, a plant can be transformed according to Bechtold and Pelletier 
(1998) Methods. Mol. Biol. 82: 259-266 and/or screened for variants as described in Winkler 
et al. (1998) Plant Physiol. 118: 743-50 and visually inspected for the desired phenotype. 
Alternatively, plants can be metabolically and/or functionally assayed according to Beligni 
and Lamattina (2000) Planta 210: 215-21), Lapwood et al (1984) Plant Pathol 33: 13-20, 
and/or Brown and Botstein (1999) Nature Genet. 21: 33-37. 

V.H.2. USE OF NITRIC OXIDE-RESPONSIVE GENES TO MODULATE 
BIOCHEMICAL ACTIVITIES: 

The activities of one or more of the nitric oxide responsive genes can be 
modulated to change biochemical or metabolic activities and/or pathways such as 
those noted below. Such biological activities can be measured according to the 
citations included in the Table below: 



PROCESS 


BIOCHEMICAL OR 
METABOLIC ACTIVITIES 
AND/OR PATHWAYS 


CITATIONS INCLUDING 
ASSAYS 


Stress Response 


-Programmed Cell Death 

-Reactive Oxygen based Defence 
Pathways 


Levine et al (1996) Curr. 
Biol 6: 427-37 
Sellins and Cohen (1991) 
Radiat. Res. 126: 88-95 
Kumar and Klessig (2000) 
Mol. Plant Microbe 
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PROCESS 


BIOCHEMICAL OR 


CITATIONS INCLUDING 






METABOLIC ACTIVITIES 


ASSAYS 






AND/OR PATHWAYS 










Interact. 13:347-351 




Disease Resistance 


-Microbial Pathogen resistance 


Lapwood et al (1984) Plant 






pathways 


Pathol 33: 13-20 








Kumar and Klessig (2000) 








Mol. Plant microbe 








interact.13: 347-351 








Klessig et.al.(2000) Proc. 








Nat. Acad. Sci USA 97: 








8849-8855 








Delledonna et al(1998) 


o 

St J 






Nature 394: 585-588 


t 




-Programmed Cell Death 


Levineetal(1996) Curr. 


w 






Biol 6: 427-437 


*? ; 
J53S. 
%f : 






Sellins and Cohen (1991) 


S 

fi'i 






Radiat. Res. 126: 88-95 


n : 
Sir 




-Cellular Protectant Gene 


Brown and Botstein (1999) 


; ; 




expression 


Nat Genet 21: 33-37 






- Phytoalexin Biosynthesis 


Davis etal. (1993) 








Phytochemistry 32: 607- 








611 
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PROCESS 


BIOCHEMICAL OR 
METABOLIC ACTIVITIES 
AND/OR PATHWAYS 


CITATIONS INCLUDING 
ASSAYS 


Signal Transduction 


Regulation of hydrogen peroxide 


Wu et al. (1995) Plant Cell 




signaling 


7, 1357-1368 


Reorientation of nitrogen 


Induction of ribosomal proteins, 


This study. Standard 


metabolism 


asparagine synthesis, proteases, 


assays for detection of 




Rnases 


changes 


Reorientation of sugar and 


Induction of sugar transporters, 


This study. Standard 


energy metabolism 


ATPases, glycohydrolases, and 


assays for detection of 




glycolytic enzymes, for example 


changes 



Other biological activities that can be modulated by the NO responsive genes and 
gene products are listed in the Reference Tables. Assays for detecting such biological 
activities are described in the Protein Domain table. 

NO responsive genes are characteristically differentially transcribed in response to 
fluctuating NO levels or concentrations, whether internal or external to an organism or cell. 
The MA_diff table(s) report(s) the changes in transcript levels of various NO responsive 
genes in aerial tissues at 1 and 6 hours after a plant was sprayed with a Silwett L-77 solution 
enriched with 5 mM sodium nitroprusside, which is an NO donor. These changes are in 
comparison with plants sprayed with Silwett L-77 solution only. 

The data from this time course can be used to identify a number of types of NO 
responsive genes and gene products, including "early responders" and "delayed responders" 
Profiles of these different nitric oxide responsive genes are shown in the Table below 
together with examples of the kinds of associated biological activities. 
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GENE 


FUNCTIONAL 


PHYSIOLOGICAL 


EXAMPLES OF 


EXPRESSION 


CATEGORY OF 


CONSEQUENCES 


BIOCHEMICAL 


LEVEL 


GENE 




ACTIVITY 


Upregulated genes 


Early responder 


- NO Perception 


-Transcription Factors 


(level at 1 hour = 6 


repressors to NO 


- NO Uptake 




hours) 




- Modulation of NO 


-Transporters 


(level at 1 hour > 6 




Response Transduction 


-Pathogen responsive 


hours) 




Pathways 


proteins, salicylic and 








jasmonate pathway 








proteins 






Specific Gene 


-Proteins to provide 






Transcription Initiation 


defence against active 






of Pathways to 


oxygen e.g. glutathione 






Optimize NO Response 


transferase,ascorbate 






Pathways 


free radical reductase, 








ascorbate peroxidase, 








nitrilase, heat shock 








proteins 








-Proteins to reorient 








metabolism 








e.g.proteases, Rnases, 








proteasomes, 








asparagine synthetase, 








glycohydrolases, 








transporters 








-Proteins to inhibit 








transport of nitric oxide 








-Degradation enzymes 


Upregulated 


Delayed NO 


- Maintenance of 


- NO Metabolic 


transcripts 


responders 


metabolism in presence 


Pathway enzymes 
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GENE 


FUNCTIONAL 


PHYSIOLOGICAL 


EXAMPLES OF 


EXPRESSION 


CATEGORY OF 


CONSEQUENCES 


BIOCHEMICAL 


LEVEL 


GENE 




ACTIVITY 


(level at 1 hour < 6 




of High NO 


-Pathogen responsive 


hours) 




-Maintenace of disease 


proteins, salicylic and 






defence pathways 


jasmonate pathway 








proteins 






-Maintenance of 


-Proteins to provide 






pathways against 


defence against active 






reactive oxygen 


oxygen e.g. glutathione 






production 


transferase,ascorbate 








free radical reductase, 








ascorbate peroxidase, 








nitrilase, heat shock 








proteins 






Maintenance of 


-Proteins to reorient 






different metabolic 


and sustain metabolism 






programs 


e.g.proteases,Rnases,pr 








oteasomes,asparagine 








synthetase,glycohydrol 








ases,transporters, 








-Proteins to inhibit 








transport of NO 






Selective cell death 


-Degradation enzymes 


Down Regulated 


-Early responders of 


Negative regulation of 


-Transcription factors 


Transcripts 


NO utilization 


NO utilization 


-Kinases and 


(level at 1 hours = 6 


pathways 


pathways released 


phosphatases 


hours) 






-Chromatin 


(level at 6 hours > 1 






restructuring proteins 


hour) 


-Genes with 


Reorientation of 


- Transcription 




discontinued 


metabolism 


factors,metabolic 
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GENE 


FUNCTIONAL 


PHYSIOLOGICAL 


EXAMPLES OF 


EXPRESSION 


CATEGORY OF 


CONSEQUENCES 


BIOCHEMICAL 


LEVEL 


GENE 




ACTIVITY 




expression or 




enzymes, kinases and 




unsTable mRNA 




phosphatases, 




following nitric oxide 




transporters, nbosomal 




uptake 




proteins 






Programmed cell death 


-Most proteins in cells 








undergoing cell death 


Down Regulated 


-Delayed responder 


Negative regulation of 


Transcription factors 


Transcripts 


repressors of NO 


NO utilization 


-Kinases and 


(level at 1 hour > 6 


stress metabolism 


pathways released 


phosphatases 


hours) 






-Chromatin 








restructuring proteins 




-Genes with 


Reorientation of 


-Transcription 




discontinued 


metabolism 


factors,metabolic 




expression or 




enzymes, kinases and 




unsTable 




phosphatases, 




mRNA following 




transporters, ribosomal 




nitric oxide uptake 




proteins. 






Programmed cell death 


-Most proteins in cells 








undergoing 








programmed cell death 



Use of Promoters of NO Responsive Genes 

Promoters of NO responsive genes are useful for transcription of any desired 
polynucleotide or plant or non-plant origin. Further, any desired sequence can be transcribed 
in a similar temporal, tissue, or environmentally specific patterns as the NO responsive genes 
where the desired sequence is operably linked to a promoter of a NO responsive gene. The 
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protein product of such a polynucleotide is usually synthesized in the same cells, in response 
to the same stimuli as the protein product of the gene from which the promoter was derived. 
Such promoter are also useful to produce antisense mRNAs to down-regulate the product of 
proteins, or to produce sense mRNAs to down-regulate mRNAs via sense suppression. 
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V J. OSMOTIC STRESS RESPONSIVE GENES, GENE COMPONENTS AND 
PRODUCTS 

The ability to endure and recover from osmotic and salt related stress is a major 
determinant of the geographical distribution and productivity of agricultural crops. Osmotic 
stress is a major component of stress imposed by saline soil and water deficit. Decreases in 
yield and crop failure frequently occur as a result of aberrant or transient environmental stress 
conditions even in areas considered suitable for the cultivation of a given species or cultivar. 
Only modest increases in the osmotic and salt tolerance of a crop species would have a 
dramatic impact on agricultural productivity. The development of genotypes with increased 
osmotic tolerance would provide a more reliable means to minimize crop losses and diminish 
the use of energy-costly practices to modify the soil environment. 

Changes in the osmotic concentration of the surrounding environment or within a 
plant results in modulation of many genes and gene products. Examples of such osmotic 
stress responsive genes and gene products, including salt responsive genes, are shown in the 
Reference, Sequence, Protein Group, Protein Group Matrix, MA_diff and MA_clust tables. 
These genes and/or products are responsible for effects on traits such as plant vigor and seed 
yield. 

While osmotic and/or salt stress responsive polynucleotides and gene products can act 
alone, combinations of these polynucleotides also affect growth and development. Useful 
combinations include different osmotic stress responsive polynucleotides and/or gene 
products that have similar transcription profiles or similar biological activities, and members 
of the same or similar biochemical pathways. In addition, the combination of an osmotic 
stress responsive polynucleotide and/or gene product with another environmentally 
responsive polynucleotide is also useful because of the interactions that exist between 
hormone regulated pathways, stress pathways, nutritional pathways and development. Here, 
in addition to polynucleotides having similar transcription profiles and/or biological 
activities, useful combinations include polynucleotides that may have different transcription 
profiles but which participate in a common pathway. 

Such osmotic and/or salt stress responsive genes and gene products can function to 
either increase or dampen the above phenotypes or activities either in response to changes in 
osmotic concentration or in the absence of osmotic fluctuations. The MA_diff Table(s) 
reports the transcript levels of the experiment (see EXPT ID: 108570, 108571, 108541, 
108542, 108553, 108539, 108540). For transcripts that had higher levels in the samples than 
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the control, a is shown. A "-"is shown for when transcript levels were reduced in root 
tips as compared to the control. For more experimental detail see the Example section below. 

Osmotic Stress genes are those sequences that showed differential expression as 
compared to controls, namely those sequences identified in the MA_diff tables with a "+" or 
"-" indication 

Osmotic Stress Genes Identified By Cluster Analyses Of Differential Expression 

Osmotic Stress Genes Identified By Correlation To Genes That Are 
Differentially Expressed 

As described above, the transcription profiles of genes that act together are well 
correlated. Applicants not only have identified the genes that are differentially expressed in 
the microarray experiments, but also have identified the genes that act in concert with them. 
The MA_clust table indicates groups of genes that have well correlated transcription profiles 
and therefore participate in the same pathway or network. 

A pathway or network of Osmotic Stress genes is any group in the MA_clust that 
comprises a cDNA ID that also appears in Expt ID 108570, 108571, 108541, 108542, 
108553, 108539, 108540 of the MA_diff table(s). 

Osmotic Stress Genes Identified By Correlation To Genes That Cause 
Physiological Consequences 

Additionally, the differential expression data and the phenotypic observations can be 
merged to identify pathways or networks of Osmotic Stress genes. A group in the MA_clust 
is considered a Osmotic Stress pathway or network if the group comprises a cDNA ID that 
also appears in Knock-in or Knock-out tables that causes one or more of the pheno types 
described in section above. 

Osmotic Stress Genes Identified By Amino Acid Sequence Similarity 
Osmotic Stress genes from other plant species typically encode polypeptides that 
share amino acid similarity to the sequences encoded by corn and Arabidopsis Osmotic Stress 
genes. Groups of Osmotic Stress genes are identified in the Protein Group table. In this 
table, any protein group that comprises a peptide ID that corresponds to a cDNA ID member 
of a Osmotic Stress pathway or network is a group of proteins that also exhibits Osmotic 
Stress functions/utilities. 
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Further, promoters of osmotic stress responsive genes, as described in the Reference 
tables, for example, are useful to modulate transcription that is induced by osmotic stress or any 
of the following phenotypes or biological activities below. 

5 

V.I. 1 . USE OF OSMOTIC STRESS RESPONSIVE GENES TO 
MODULATE PHENOTYPES 
Osmotic stress responsive genes and gene products are useful to or modulate one or 
more of the following phenotypes: 
10 - Growth 

- Roots 
Stems 

- Leaves 

O - Development 

Is - Cell Growth 

% - DNA Synthesis and Cell Division 

W - Seed Development 

fh - Desiccation tolerance 

L - Dormancy 

W.0 - Control rate of Germination 

ax : 

j* - Prolongs Seed Storage and Viability 

^ - Senescence 

s 

Stress Responses 

- Desiccation 
25 - Drought 

- Salt 

To regulate any of the phenotype(s) above, activities of one or more of the osmotic 
stress responsive genes or gene products can be modulated and the plants tested by screening 
30 for the desired trait. Specifically, the gene, mRNA levels, or protein levels can be altered in a 
plant utilizing the procedures described herein and the phenotypes can be assayed. As an 
example, a plant can be transformed according to Bechtold and Pelletier (1998, Methods. 
Mol. Biol. 82:259-266) and/or screened for variants as in Winkler et al. (1998) Plant Physiol 
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118: 743-50 and visually inspected for the desired phenotype or metabolically and/or 
functionally assayed according to de Castro (1998, Phytochemistry 47: 689-694), Xu (1998, J 
Exp Bot 49: 573-582), Ausubel et al. (In: Current Protocols in Molecular Biology (1999) 
Volume 1, chapter 4, eds. Ausubel, Brent, Kingston, Moore, Seidman, Smith and Struhl, New 
York, NY) and De Castro et al. (2000, Plant Physiol 122: 327-36) 



V .1.2. USE OF OSMOTIC STRESS RESPONSIVE GENES TO 
MODULATE BIOCHEMICAL ACTIVITIES 

The activities of one or more of the osmotic stress responsive genes can be 
modulated to change biochemical or metabolic activities and/or pathways such as 
those noted below. Such biological activities can be measured according to the 
citations included in the Table below: 



PROCESS 


BIOCHEMICAL OR 
METABOLIC ACTIVITIES 
AND/OR PATHWAYS 


CITATIONS 

INCLUDING 

ASSAYS 


Cell Growth And 
Differentiation 


Regulation Of Osmolyte 
Synthesis 


Yoshu et al. (1995) 
The Plant Journal 
7:751-60 




Regulation Of Glycolate Pathway 
And Photoinhibition Of 
Photosystem II In Response To 
Stress 


Strebet al. (1993) 
Physiologia 
Plantarum. 88:590- 
598 


Gene Regulation 


Transcriptional Regulation Of 
Osmotic Stress Induced Proteins 
Through DNA Binding Proteins 


Current Protocols in 
Molecular Biology / edited 
by Frederick M. Ausubel .. 
[et al.]. New York : 
Published by Greene Pub. 
Associates and Wiley- 
Interscience : J. Wiley, 
cl987 




Transcriptional Regulation Of 
Osmotic Stress Induced Proteins 


Jonak (1996) Proceedings 
of the National Academy of 
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PROCESS 


BIOCHEMICAL OR 


CITATIONS 






METABOLIC ACTIVITIES 


INCLUDING 






AND/OR PATHWAYS 


ASSAYS 






Through Protein Phosphorylation 


Sciences of the United 






And Dephosphorylation 


States of America, 93: 








11274-11279; 








Monroy, A.et al., 








(1998) 








Analytical Biochemistry 








265: 183-185; 






Regulation Of Osmotic Stress 


McCright(1998) IN: 






Induced Gene Protein 


Methods in Molecular 






Accumulation By Protein Protein 


Biology; Protein 


£ — V 




Intereaction Between Osmotic 


phosphatase protocols; 


iO 




Stress Regulated Genes And 


Ludlow (1998) Humana 


f"~ 
PI 




Protein Phosphatase 2C 


Press Inc. ; Suite 808, 999 


w 






Riverview Drive, Totowa, 


m 

«r - 






New Jersey 


s 

Q 






07512, USA. :263-277. 






Transcriptional Regulation Of 


Luo and Dean (1999) 






Heat Induced Genes Through 


Journal of the 






Chromatin Remodeling 


National Cancer 








Institute 91: 1288- 








1294; 








Chromatin 








protocols (1999) 








edited by Peter B. 








Becker. Totowa, 








NJ. : Humana 








Press 






Activity Of Abcisic Acid 


Gubler et al. (1999) 






Regulated DNA Binding Proteins 


Plant Journal 17: 1- 
9 
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PROCESS 


BIOCHEMICAL OR 
METABOLIC ACTIVITIES 
AND/OR PATHWAYS 


CITATIONS 

INCLUDING 

ASSAYS 




Accumulation Of RNA Binding 
Proteins That Regulate Osmotic 
Stress 


Sato (1995) 
Nucleic Acids Research 23: 
2161-2167. 


Stress Response 


Synthesis And Metabolism Of 
Osmoprotectants Such As 
Betaine, Proline And Trehalase 


Minocha et al. 
(1999) Plant 
Physiol and 
Biochem 37: 597- 
603 




Regulation Of Sugar Transporters 


Dejardin et al. 
(1999) Biochem J; 
344 Pt 2:503-9 




Regulation Of Vacuolar 
Sodium/Proton Antiport Activity 
And The Detoxification Of 
Cations 


Gaxiola et al. 
(1999) PNAS USA 
96: 1480-1485 




Regulation Of Intracellular Na+ 
And Li+ Ion Concentrations 


Espinoza-Ruiz et 
al.(1999) The Plant 
Journal 20: 529-539 




Regulation Of Universal Stress 
Protein Homologue Activity By 
Phosphorylation And 
Dephosphorylation. 


Freestone et al. 
(1997) Journal of 
Molecular Biology, 
v. 274:318-324 




Regulation/Maintenance Of 
Protein Stability During Thermal 
Stress 


Walker (1996) 
Humana Press Inc. 
Suite 808, 999 
Riverview Drive, 
Totowa, New 
Jersey 07512, USA 




Regulation Of Protein 
Degradation During Thermal 


Vierstra(1996) Plant 
Molecular Biology,32:275- 
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PROCESS 


BIOCHEMICAL OR 
METABOLIC ACTIVITIES 
AND/OR PATHWAYS 


CITATIONS 

INCLUDING 

ASSAYS 




Stress. 


302. 

Vierstra and Callis 
(1999) Plant 
Molecular Biology, 
41:435-442 


Signal Transduction 


Activation Of Stress Response 
Genes 


Xinong et al. 
(1999) The Plant 
Journal 19: 569-578 




Salt Tolerance 


Piao(1999) Plant 
Physiol 19: 1527- 
1534 




Calcium Mediated Stress 
Response 


Subbaiah et al. 
(1994) Plant 
Physiology 
105:369-376 
Kudlaetal. (1999) 
PNAS USA 96: 
4718-4723 



Other biological activities that can be modulated by the osmotic stress responsive 
genes and gene products are listed in the Reference tables. Assays for detecting such 
biological activities are described in the Protein Domain table. 

Osmotic stress responsive genes are characteristically differentially transcribed in 
response to fluctuating osmotic stress levels or concentrations, whether internal or external to 
an organism or cell. MA_diff table reports the changes in transcript levels of various osmotic 
stress responsive genes in aerial tissues of plants at 1 and 6 hours after the plants were 
sprayed with Hoagland's solution containing 20% PEG as compared to aerial tissues from 
plants sprayed with Hoagland's solution only. 

The data from this time course can be used to identify a number of types of 
osmotic stress responsive genes and gene products, including "early responding," "sustained 
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osmotic stress responders," "repressors of osmotic stress pathways" and "osmotic stress 
responders." Profiles of these different osmotic stress responsive genes are shown in the 
Table below together with examples of the kinds of associated biological activities. 



GENE 


FUNCTIONAL 


PHYSIOLOGICAL 


EXAMPLES OF 


EXPRESSION 


CATEGORY OF 


CONSEQUENCES 


BIOCHEMICAL 


LEVELS 


GENES 




ACTIVITIES OF 








GENE PRODUCTS 


Up Regulated 


Early Responders 


• Osmotic Stress 


• Transcription 


Transcripts 


To Osmotic 


Perception 


Factors 


(Level At 1 Hour = 


Stress 


Osmolyte Uptake 


• Transcription 


6 Hours) 




• Modulation Of 


Coactivators 


(Level At 1 Hour > 


Universal Stress 


Osmotic Stress 


Membrane 


6 Hours) 


Response Genes 


Response Signal 


Transporters 






Transduction Pathways 


Proline 




Osmotic Stress 


Specific Gene 


Biosynthesis 




Responders 


Transcription 


Selective Inhibition 






Initiation 


Of Osmolyte 




• Abscisic Acid 


Specific Gene 


Transport 




Biosynthesis And 


Transcription 


• Protein 




Perception 


Repression 


Ubiquitination 






• Translation Activation 


♦ Protein 






• Translation 


Degradation 






Repression 


• Rna Binding 






• Repression Of 


Proteins 






"Normal State" 


Modification Of 






Pathways To Optimize 


Protein Activity By 






Osmotic Stress 


Phosphatases, 






Response 


Kinases 






• Activation Of Stress 


• Synthesis And Or 






Signaling Pathways 


Activation Of 






• Up Regulation Of 


Oxide Hydrolases, 






Abscisic Acid 


Suoeroxidedismuta 






Biosynthesis Pathway 


se, Iron Ascorbate 
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Protein Accumulation 


Peroxidase 






And Activity 


Activation Of 






Scavenging Reactive 


Signaling Pathway 






Oxygen Species 


By Calcium 






• Modification Of Cell 


Binding Proteins, 






Wall Composition 


• Modification Of 






• Up-Regulation Of 


Protein Activity By 






Universal Stress 


Protein-Protein 






Response Protein 


Interaction 






Accumulation 


Change In 








Chromatin 








Structure And/Or 








Localized Dna 








Topology 








Modification Of 








Pre-Existing 








Translation Factors 








By Phosphorylation 








(Kinases) Or 








Dephosphorylation 








(Phosphatases) 








• Synthesis Of New 








Translation Factors 








• Abscisic Acid 








Biosynthesis 


Up Regulated 


Sustained 


• Osmolyte Adjustment 


Osmotic Stress 


Transcripts 


Osmotic Stress 


And Adaptation 


Metabolic 


(Level At 1 Hr < 6 


Responders 


• Photosynthetic 


Pathways 


Hr) 




Activity Modification 


• Sugar Biosynthetic 




Repressor Of 


• Activation Of 


Pathways 




Osmotic Stress 


"Normal State" 


Sugar Transporters 




Pathways 


Biosynthesis Genes 


. Transcription 






Negative Regulation 


Factors 
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• Abscisic Acid 


Of Osmotic Stress 


Transcription 




Perception, 


Pathways 


Coactivators 




Biosynthesis And 


• Negative Regulation 


Membrane 




Regulation 


Of Abscisic Acid 


Transporters 






Biosynthesis 


Abscisic Acid 






• Acivation Of Abscisic 


Biosynthesis 






Acid Degradation 








Pathway 








• Cell Wall 








Composition 








Modification 




Down-Regulated 


Early Responder 


• Metabolic Repression 


Transcription 


Transcripts 


Repressors Of 


• Specific Gene 


Factors 


(Level At 1 Hr « 6 


"Normal" State Of 


Transcription 


Transcription 


Hr) 


Metabolism 


Initiation 


Coactivators 


(Level At 6 Hr > 1 




• Specific Gene 


Protein 


Hr) 


Negative Regulators 


Transcription 


Degradation 




Of Abscisic Acid 


Repression 


• Rna Binding 




Biosynthesis And 


• Translation Activation 


Proteins 




Perception. 


Translation 


• Modification Of 






Repression 


Protein Activity By 




Positive Regulators 


• Abscisic Acid 


Phosphatases, 




Of "Normal State" 


Degradation 


Kinases 




Metabolic Pathways. 


• Protein Degradation 


• Activation Of 








Signaling Pathway 








By Calcium 








Binding Proteins, 








Modification Of 








Protein Activity By 








Protein-Protein 








Interaction 








Change In 








Chromatin 
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Structure And/Or 
Localized Dna 
Topology 
Modification Of 
Pre-Existing 
Translation Factors 
By Phosphorylation 
(Kinases) Or 
Dephosphorylation 
(Phosphatases) 
Synthesis Of New 
Translation Factors 


Down-Regulated 


Repressors Of 


• Osmotic Stress 


Transcription 


Transcripts 


"Normal" State Of 


Adaptation 


Factors 


(Level At 1 Hr > 6 


Metabolism 


• Negative Regulation 


♦ Transcription 


Hr) 




Of Abscisic Acid 


Coactivators 




Genes With 


Biosynthesis 


• Protein 




Discontinued 


• Negative Regulation 


Degradation 




Expression Or 


Of Osmotic Stress 


• Rna Binding 




UnsTable mRNA In 


Response Pathways 


Proteins 




Presence Of Osmotic 


Genes 


• Modification Of 




Stress 


Osmolyte Synthesis 


Protein Activity By 






And Osmolyte 


Phosphatases, 




Repressor Of 


Cellular Partitioning 


Kinases 




Osmotic Stress 


Readjustment 


Activation Of 




Pathways 


• Activation Of 


Signaling Pathway 






"Normal State" 


By Calcium 




Repressors Of 


Metabolic Pathways 


Binding Proteins, 




Abscisic Acid 




• Modification Of 




Biosynthesis, 




Protein Activity By 




Perception And 




Protein-Protein 




Regulation 




Interaction 








• Change In 
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Chromatin 
Structure And/Or 
Localized Dna 
Topology 

• Modification Of 
Pre-Existing 
Translation Factors 
By Phosphorylation 
(Kinases) Or 
Dephosphorylation 
(Phosphatases) 
Synthesis Of New 
Translation Factors 

• Sugar Biosynthetic 
Pathways 

. Sugar Transporters 



Further, any desired sequence can be transcribed in similar temporal, tissue, or 
enviromentally specific patterns as the osmotic stress responsive genes when the desired 
sequence is operably linked to a promoter of an osmotic stress responsive gene. 
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V.J. ALUMINUM RESPONSIVE GENES, GENE COMPONENTS AND 
PRODUCTS 

Aluminum is toxic to plants in soluble form (Al 3+ ). Plants grown under aluminum 
stress have inhibited root growth and function due to reduced cell elongation, inhibited cell 
division and metabolic interference. As an example, protein inactivation frequently results 
from displacement of the Mg2+ cofactor with aluminum. These types of consequences result 
in poor nutrient and water uptake. In addition, because stress perception and response occur 
in the root apex, aluminum exposure leads to the release of organic acids, such as citrate, 
from the root as the plant attempts to prevent aluminum uptake. 

The ability to endure soluble aluminum is a major determinant of the geographical 
distribution and productivity of agricultural crops. Decreases in yield and crop failure 
frequently occur as a result of aberrant, hot conditions even in areas considered suiTable for 
the cultivation of a given species or cultivar. Only modest increases in the aluminum 
tolerance of crop species would have a dramatic impact on agricultural productivity. The 
development of genotypes with increased aluminum tolerance would provide a more reliable 
means to minimize crop losses and diminish the use of costly practices to modify the 
environment. 

Microarray technology allows monitoring of gene expression levels for thousands of 
genes in a single experiment. This is achieved by simultaneously hybridizing two 
differentially labeled fluorescent cDNA pools to glass slides that contain spots of DNA 
(Schena et al. (1995) Science 270: 467-70). The Arabidopsis Functional Genomics 
Consortium (AFGC) has recently made public the results from such microarray experiments 
conducted with AFGC chips containing 10,000 non-redundant ESTs, selected from 37,000 
randomly sequenced ESTs generated from mRNA of different tissues and developmental 
stages. 

The sequences of the ESTs showing at least two-fold increases or decreases over the 
controls were identified, compared to the Ceres full-length cDNA and genomic sequence 
databanks, and identical Ceres clones identified. MA_diff table reports the results of this 
analysis, indicating those Ceres clones which are up or down regulated over controls, thereby 
indicating the Ceres clones which are aluminum response responsive genes. 

The MA_diff Table(s) reports the transcript levels of the experiment (see EXPT ED: 
Aluminum (relating to SMD 7304, SMD 7305)). For transcripts that had higher levels in the 
samples than the control, a "+" is shown. A "-" is shown for when transcript levels were 
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reduced in root tips as compared to the control. For more experimental detail see the 
Example section below. 

Aluminum genes are those sequences that showed differential expression as compared 
to controls, namely those sequences identified in the MA_diff tables with a or "-" 
indication 

Aluminum Genes Identified By Cluster Analyses Of Differential Expression 

Aluminum Genes Identified By Correlation To Genes That Are Differentially 

Expressed 

As described above, the transcription profiles of genes that act together are 
well correlated. Applicants not only have identified the genes that are differentially 
expressed in the microarray experiments, but also have identified the genes that act in concert 
with them. The MA clust table indicates groups of genes that have well correlated 
transcription profiles and therefore participate in the same pathway or network. 

A pathway or network of Aluminum genes is any group in the MA_clust that 
comprises a cDNA ED that also appears in Expt ID Aluminum (relating to SMD 7304, SMD 
7305) of the MA_diff table(s). 

Aluminum Genes Identified By Correlation To Genes That Cause 
Physiological Consequences 

Additionally, the differential expression data and the phenotypic observations 
can be merged to identify pathways or networks of Aluminum genes. A group in the 
MA_clust is considered a Aluminum pathway or network if the group comprises a cDNA ID 
that also appears in Knock-in or Knock-out tables that causes one or more of the phenotypes 
described in section above. 

Aluminum Genes Identified By Amino Acid Sequence Similarity 

Aluminum genes from other plant species typically encode polypeptides that 
share amino acid similarity to the sequences encoded by corn and Arabidopsis Aluminum 
genes. Groups of Aluminum genes are identified in the Protein Group table. In this table, 
any protein group that comprises a peptide ID that corresponds to a cDNA ID member of a 
Aluminum pathway or network is a group of proteins that also exhibits Aluminum 
functions/utilities. 
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V.J.I. USE OF ALUMINUM RESPONSE GENES TO MODULATE 
PHENOTYPES 

Changes in aluminum concentrations in a plant's surrounding environment 
results in modulation of many genes and gene products. Examples of such aluminum 
response genes and gene products are shown in the Reference and Sequence Tables. 
These genes and/or products are responsible for effects on traits such as plant vigor 
and seed yield. 

While aluminum responsive polynucleotides and gene products can act alone, 
combinations of these polynucleotides also affect growth and development. Useful 
combinations include different aluminum responsive polynucleotides and/or gene 
products that have similar transcription profiles or similar biological activities, and 
members of the same or similar biochemical pathways. In addition, the combination 
of a aluminum responsive polynucleotide and/or gene product with another 
environmentally responsive polynucleotide is also useful because of the interactions 
that exist between hormone regulated pathways, stress pathways, nutritional pathways 
and development. Here, in addition to polynucleotides having similar transcription 
profiles and/or biological activities, useful combinations include polynucleotides that 
may have different transcription profiles but which participate in a common pathway. 

Such aluminum responsive genes and gene products can function to either 
increase or dampen the above phenotypes or activities either 

• in response to changes in aluminum concentration or 

• in the absence of aluminum fluctuations. 

More specifically, aluminum responsive genes and gene products are useful to 
or modulate one or more of the following phenotypes: 
Growth 

- Roots 

Inhibition of root elongation 

- Stems 

- Leaves 

- Whole Plant 

- Development 
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- Cell Growth 

Elongation 
- Division 

- Mediates response to oxidative stress, calcium-mediated defense, 
antioxidant defense and pathogenesis 



To produce the desired phenotype(s) above, one or more of the aluminum 
response genes or gene products can be tested by screening for the desired trait. 
Specifically, the gene, mRNA levels, or protein levels can be altered in a plant 
utilizing the procedures described herein and the phenotypes can be assayed. As an 
example, a plant can be transformed according to Bechtold and Pelletier (1998, 
Methods. Mol. Biol. 82:259-266) and visually inspected for the desired phenotype or 
metabolically and/or functionally assayed according to Li and Fleming (1999, FEBS 
Lett 461 : 1-5), Delhaize et al. (1999, J Biol Chem 274: 7082-8), Sigimoto and 
Sakamoto (1997, Genes Genet Syst 72: 311-6), Esaki et al. (2000, Plant Physiol 122: 
657-65), Leonard and Gerber (1988, Mutat Res 196: 247-57), Baisakhi et al. (2000, 
Mutat Res 465: 1-9), Ma (2000, Plant Cell Physiol 41 : 383-90) and Koyama et al 
(1999, Plant Cell 40: 482-8) 

Alternatively, the activities of one or more of the aluminum responsive genes can be 
modulated to change biochemical or metabolic activities and/or pathways such as those noted 
below. Such biological activities can be measured according to the citations included in the 
Table below: 



GENERAL 
CATEGORY 


BIOCHEMICAL OR 
METABOLIC 
ACTIVITIES AND/OR 
PATHWAYS 


ASSAY 


Cell Growth and 
Development 


-Phospholipase D (PLD) 
activity 

-Regulation of 
Phosphtidylserine 


Toda et al. (1999) 
Biosci Biotechnol 
Biochem 63: 210- 
212 
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Synthase (PSS) 






-Cell wall strengthening 


Hamel et al. (1998) 
Planta205: 531-38 


Stress Response 


-Regulation of oxidative 
stress 


Esaki et al. (2000) 
Plant Physiol 122: 
657-655 




-Regulation of 
antioxidant defense and 
DNA repair 


Baisakhi et al. 

/r\ r\r\(W Ajf,,4-«+ T) 

(2000) Mutat Res 
465: 1-9 




-Secretion of Organic 
Acids (e.g. maleate, 
citrate) from root apex 


Koyama et al. 
(1999) Plant Cell 
40: 482-8 




-Ca2+mediated Defense 
Responses Against Low 
pH 


Plieth et al. (1999) 
Plant J 18: 634-50 


Signaling 


-H+ transport 
-Auxin transport 


Degenhardt et al. 
(1988) Plant Physil 
117: 19-27 
Rashotte et al. 
(2000) Plant 
Physiol 122: 481- 
90 



Other biological activities that can be modulated by aluminum response genes and 
their products are listed in the REFERENCE Table. Assays for detecting such biological 
activities are described in the Protein Domain table. 
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TRANSCRIPT 


TYPE OF GENES 


PHYSIOLOGICAL 


EXAMPLES OF 


LEVELS 




CONSEQUENCES 


BIOCHEMICAL 








ACTIVITY 


Up regulated 


responders to 


• Aluminum 


• Transporters 


transcripts 


aluminum 


perception 


• Metabolic enzymes 




application 


Aluminum uptake 


• Change in cell 






and transport 


membrane structure 






Aluminum 


and potential 






metabolism 


• Kinases and 






Synthesis of 


phosphatases 






secondary 


• Transcription 






metabolites and/or 


activators 






proteins 


Change in chromatin 








structure and/or 






• Modulation of 


localized DNA 






aluminum 


topology 






response 


• 






transduction 








pathways 








Specific gene 








transcription 








initiation 




Down-regulated 


responder to 


• Negative 


• Transcription factors 


transcripts 


aluminum 


regulation of 


Change in protein 




repressors of 


aluminum 


structure by 




aluminum state of 


pathways 


phosphorylation 




metabolism 




(kinases) or 






♦ Changes in 


dephosphorylation 




Genes with 


pathways and 


(phosphatases) 




discontinued 


processes 


• Change in chromatin 




expression or 


operating in cells 


structure and/or DNA 
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TRANSCRIPT 
LEVELS 


TYPE OF GENES 


PHYSIOLOGICAL 
CONSEQUENCES 


EXAMPLES OF 
BIOCHEMICAL 
ACTIVITY 




unsTable mRNA in 
presence of aluminum 


Changes in other 
metabolisms than 
aluminum 


topology 

Stability of factors for 
protein synthesis and 
degradation 
• Metabolic enzymes 



Use of Promoters of Aluminum Responsive Genes 

Promoters of Aluminum responsive genes are useful for transcription of any desired 
polynucleotide or plant or non-plant origin. Further, any desired sequence can be transcribed 
in a similar temporal, tissue, or environmentally specific patterns as the Aluminum 
responsive genes where the desired sequence is operably linked to a promoter of a Aluminum 
responsive gene. The protein product of such a polynucleotide is usually synthesized in the 
same cells, in response to the same stimuli as the protein product of the gene from which the 
promoter was derived. Such promoter are also useful to produce antisense mRNAs to down- 
regulate the product of proteins, or to produce sense mRNAs to down-regulate mRNAs via 
sense suppression. 
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V.K. CADMIUM RESPONSIVE GENES, GENE COMPONENTS AND 
PRODUCTS 

Cadmium (Cd) has both toxic and non-toxic effects on plants. Plants exposed to non- 
toxic concentrations of cadmium are blocked for viral disease due to the inhibition of 
systemic movement of the virus. Surprisingly, higher, toxic levels of Cd do not inhibit viral 
systemic movement, suggesting that cellular factors that interfere with the viral movement 
are triggered by non-toxic Cd concentrations but repressed in high Cd concentrations. 
Furthermore, exposure to non-toxic Cd levels appears to reverse posttranslational gene 
silencing, an inherent plant defense mechanism. Consequently, exploring the effects of Cd 
exposure has potential for advances in plant disease control in addition to soil bio- 
remediation and the improvement of plant performance in agriculture. 

Changes in cadmium concentrations in a plant's surrounding environment results in 
modulation of many genes and gene products. Microarray technology allows monitoring of 
gene expression levels for thousands of genes in a single experiment. This is achieved by 
simultaneously hybridizing two differentially labeled fluorescent cDNA pools to glass slides 
that contain spots of DNA (Schena et al. (1995) Science 270: 467-70). The US Arabidopsis 
Functional Genomics Consortium (AFGC) has recently made public the results from such 
microarray experiments conducted with AFGC chips containing some 10,000 non-redundant 
ESTs, selected from about 37,000 randomly sequenced ESTs generated from mRNA of 
different tissues and developmental stages. 

The sequences of the ESTs showing at least two-fold increases or decreases in plants 
treated with 10 jaM cadmium compared with untreated plants were identified, compared to 
the Ceres full length cDNA and genomic sequence databanks, and the equivalent Ceres 
clones identified. The MA_diff table(s) report(s) the results of this analysis, indicating those 
Ceres clones which are up or down regulated over controls, thereby indicating the Ceres 
clones which represent cadmium responsive genes. 

Examples of such cadmium responsive genes and gene products are shown in the 
Reference and Sequence Tables. These genes and/or products are responsible for effects on 
traits such as plant vigor and seed yield. 

While cadmium responsive polynucleotides and gene products can act alone, 
combinations of these polynucleotides also affect growth and development. Useful 
combinations include different cadmium responsive polynucleotides and/or gene products 
that have similar transcription profiles or similar biological activities, and members of the 
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same or similar biochemical pathways. Whole pathways or segments of pathways are 
controlled by transcription factor proteins and proteins controlling the activity of signal 
transduction pathways. Therefore, manipulation of such protein levels is especially useful for 
altering phenotypes and biochemical activities of plants. In addition, the combination of a 
cadmium responsive polynucleotide and/or gene product with other environmentally 
responsive polynucleotides is also useful because of the interactions that exist between, for 
example, stress and pathogen induced pathways, nutritional pathways and development. 
Here, in addition to polynucleotides having similar transcription profiles and/or biological 
activities, useful combinations include polynucleotides that may have different transcription 
profiles but which participate in common or overlapping pathways. 

The MAdiff Table(s) reports the transcript levels of the experiment (see EXPT ID: 
Cadium (relating to SMD 7427, SMD 7428)). For transcripts that had higher levels in the 
samples than the control, a is shown. A "-" is shown for when transcript levels were 
reduced in root tips as compared to the control. For more experimental detail see the 
Example section below. 

Cadium genes are those sequences that showed differential expression as compared 
to controls, namely those sequences identified in the MA_diff tables with a "+" or "-" 
indication 

Cadium Genes Identified By Cluster Analyses Of Differential Expression 

Cadium Genes Identified By Correlation To Genes That Are Differentially 

Expressed 

As described above, the transcription profiles of genes that act together are 
well correlated. Applicants not only have identified the genes that are differentially 
expressed in the microarray experiments, but also have identified the genes that act in concert 
with them. The MA_clust table indicates groups of genes that have well correlated 
transcription profiles and therefore participate in the same pathway or network. 

A pathway or network of Cadium genes is any group in the MA_clust that 
comprises a cDNA ID that also appears in Expt ID Cadium (relating to SMD 7427, SMD 
7428) of the MA_diff table(s). 
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Cadium Genes Identified By Correlation To Genes That Cause Physiological 

Consequences 

Additionally, the differential expression data and the phenotypic observations 
5 can be merged to identify pathways or networks of Cadium genes. A group in the MA_clust 
is considered a Cadium pathway or network if the group comprises a cDNA ID that also 
appears in Knock-in or Knock-out tables that causes one or more of the phenotypes described 
in section above. 

10 Cadium Genes Identified By Amino Acid Sequence Similarity 

Cadium genes from other plant species typically encode polypeptides that 
share amino acid similarity to the sequences encoded by corn and Arabidopsis Cadium 
genes. Groups of Cadium genes are identified in the Protein Group table. In this table, any 
p protein group that comprises a peptide ID that corresponds to a cDNA ID member of a 
:E5 Cadium pathway or network is a group of proteins that also exhibits Cadium 
4^ functions/utilities, 
i \j 

H; Such cadmium responsive genes and gene products can function to either increase or 

5 dampen phenotypes or activities either in response to changes in cadmium concentration or in 
gjo the absence of cadmium fluctuations. Further, promoters of cadmium responsive genes, as 
l Z described in the Reference tables, for example, are useful to modulate transcription that is 
O induced by cadmium or any of the following phenotypes or biological activities below. 

V.K.1. USE OF CADMIUM RESPONSIVE GENES, GENE COMPONENTS 
25 AND PRODUCTS TO MODULATE PHENOTYPES 

Cadmium responsive genes and gene products are useful to or modulate one or more 
of the following phenotypes: 

• Growth 
. Roots 

30 • Initiation and maintenance of cell division 

• Stems 
Leaves 

• Development 
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• Mitochondria 

• Post-embryonic root meristem development 
Senescence 

Stress Response 

. Modulation of Jasmonic Acid and other stress control pathways 

• Metabolic detoxification 

• Heavy metals 

- Plant and Seed Yield; Fruit Yield 

Further, any desired sequence can be transcribed in similar temporal, tissue, or 
environmentally specific patterns as the cadmium responsive genes when the desired 
sequence is operably linked to a promoter of a cadmium responsive gene. 

To regulate any of the phenotype(s) above, activities of one or more of the 
cadmium responsive genes or gene products can be modulated and tested by screening 
for the desired trait. Specifically, the gene, mRNA levels, or protein levels can be 
altered in a plant utilizing the procedures described herein and the phenotypes can be 
assayed. As an example, a plant can be transformed according to Bechtold and 
Pelletier (1998) Methods. Mol. Biol. 82:259-266) and/or screened for variants as in 
Winkler et al. (1998) Plant Physiol 118: 743-50 and visually inspected for the desired 
phenotype or metabolically and/or functionally assayed according to Ghoshroy et al. 
(1998, Plant J 13: 591-602), Citovsky et al. (1998, Plant J 16: 13-20), Clemens et al. 
(1999, EMBO J 18: 3325-33), Chen et al. (2000, Chemosphere 41 : 229-34), Xian and 
Oliver (1998, Plant Cell 10: 1539-90), Romero-Peurtas et al. (1999, Free Rad Res 31: 
S25-31), Gaur and Noraho (1995, Biomed Environ Sci 8: 202-10), Thomine et al. 
(2000, PNAS USA 97: 4991-6), Howden et al. (1995, Plant Physiol 107: 1067-73), 
Kesseler and Brand (1994, Eur J Biochem 225: 907-22) and Vernoux et al. (2000, 
Plant Cell 12:97-110). 

V.K.2. USE OF CADMIUM-RESPONSIVE GENES, GENE 
COMPONENTS AND PRODUCTS TO MODULATE 
BIOCHEMICAL ACTIVITIES 

The activities of one or more of the cadmium responsive genes can be 
modulated to change biochemical or metabolic activities and/or pathways such as 
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those noted below. Such biological activities can be measured according to the 
citations included in the Table below: 
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PROCESS 


BIOCHEMICAL OR 
METABOLIC ACTIVITIES 
AND/OR PATHWAYS 


CITATIONS INCLUDING 
ASSAYS 


Growth , 

Differentiation and 
Development 


Root Growth 

Initiation and maintenance of 
cell division 
Resistance to Cadmium- 
inhibition of root growth 


Thomine et al. (2000) PNAS 
USA 97: 4991-6 
Vernoux et al. (2000) Plant Cell 
12: 97-110 


Metabolism 


Cadmium sensing 


Howden et al. (1995) Plant 
Physiol 107: 1067-73 




Cadmium uptake and 
transport 

Decreased cadmium 
transport 
Phytoremediation 


Gaur and Noraho (1995) Biomed 
Environ Sci 8: 202-10 
Thomine et al. (2000) PNAS 
USA 97: 4991-6 




Inhibition of oxidative 
phophorylation 


Kesseler and Brand (1994) Eur. 
Biochem 225 : 907-22 


Plant Defenses 


Viral resistance 
Inhibition of systemic 
movement of virus 
Block of viral disease 


Ghoshroy et al. (1998) Plant J 
13: 591-602 




Detoxification of heavy 
metals 


Clemens et al. (1999) EMBO J 
1_8: 3325-33 




Enhanced stress resistance 


Romero-Peurtas et al. (1999) 
Free Rad Res 31: S25-31 




Cadmium resistance 

via modulation of jasmonic 

acid signaling pathway 


Xiang and Oliver (1998) Plant 
Cell 10: 1539-90 


Signaling 


Relief of post-translational 


Citovsky et al. (1998) Plant J 16: 
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PROCESS 


BIOCHEMICAL OR 
METABOLIC ACTIVITIES 
AND/OR PATHWAYS 


CITATIONS INCLUDING 
ASSAYS 




gene silencing 


13-20 



Other biological activities that can be modulated by the cadmium responsive genes 
and gene products are listed in the Reference tables. Assays for detecting such biological 
activities are described in the Protein Domain table. 

Cadmium responsive genes are characteristically differentially transcribed in response 
to fluctuating cadmium levels or concentrations, whether internal or external to an organism 
or cell The MA_diff table(s) report(s) the changes in transcript levels of various cadmium 
responsive genes following treatment with 10 \xM cadmium, relative to untreated plants. 
Profiles of some cadmium responsive genes are shown in the Table below together with 
examples of the kinds of associated biological activities. 



TRANSCRIPT 


TYPE OF 


PHYSIOLOGICAL 


EXAMPLES OF 


LEVELS 


GENES 


CONSEQUENCES 


BIOCHEMICAL 








ACTIVITY 


Up regulated 


Responders to 


Cadmium perception 


Transporters 


transcripts 


cadmium 


Cadmium uptake and 


Metabolic enzymes 




Application 


transport 


Change in cell membrane 






Cadmium metabolism 


structure and potential 




Genes induced by 


Synthesis of secondary 


Kinases and 




cadmium 


metabolites and/or 


Phosphatases 






proteins 


Transcription activators 






Modulation of 


Change in chromatin 






cadmium response 


structure and/or localized 






transduction pathways 


DNA topology 






Specific gene 


RNA binding proteins 






transcription initiation 








Genes involved in 








inhibiting systemic 








movement of plant 








viral RNA 
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TRANSCRIPT 


TYPE OF 


PHYSIOLOGICAL 


EXAMPLES OF 


LEVELS 


GENES 


CONSEQUENCES 


BIOCHEMICAL 








ACTIVITY 






Genes involved in post 








translational gene 








silencing 




Down-regulated 


Responders to 


Negative regulation of 


Transcription factors 


transcripts 


cadmium 


cadmium pathways 


Change in protein 






released 


structure by 




Genes repressed 




phosphorylation 




by cadmium 


Changes in pathways 


(kinases) or 






and processes operating 


Dephosphoryaltion 




Genes with 


in cells 


(phosphatases) 




discontinued 


Changes in metabolism 


Change in chromatin 




expression or 


other than cadmium 


structure and/or DNA 




unsTable mRNA 


pathways 


topology 




in presence of 


Genes involved in 


Factors for protein 




cadmium 


facilitating systemic 


synthesis and degradation 






movement of plant 


Metabolic enzymes 






viral RNA 


RNA binding proteins 






Genes involved in 








promoting post 








translational gene 








silencing 





Use of Promoters of Cadmium Responsive Genes 

Promoters of Cadmium responsive genes are useful for transcription of any desired 
polynucleotide or plant or non-plant origin. Further, any desired sequence can be transcribed 
in a similar temporal, tissue, or environmentally specific patterns as the Cadmium responsive 
genes where the desired sequence is operably linked to a promoter of a Cadmium responsive 
gene. The protein product of such a polynucleotide is usually synthesized in the same cells, 
in response to the same stimuli as the protein product of the gene from which the promoter 
was derived. Such promoter are also useful to produce antisense mRNAs to down-regulate 
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the product of proteins, or to produce sense mRNAs to down-regulate mRNAs via sense 
suppression. 
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V.L. DISEASE RESPONSIVE GENES, GENE COMPONENTS AND 
PRODUCTS 

Often growth and yield are limited by the ability of a plant to tolerate stress conditions, 
including pathogen attack. To combat such conditions, plant cells deploy a battery of inducible 
5 defense responses, including the triggering of an oxidative burst and the transcription of 

pathogenesis-related protein (PR protein) genes. These responses depend on the recognition of a 
microbial avirulence gene product (avr) by a plant resistance gene product (R), and a series of 
downstream signaling events leading to transcription- independent and transcription-dependent 
disease resistance responses. Reactive oxygen species (ROS) such as H2O2 and NO from the 
10 oxidative burst plays a signaling role, including initiation of the hypersensitive response (HR) 
and induction of systemic acquired resistance (SAR) to secondary infection by unrelated 
pathogens. PR proteins are able to degrade the cell walls of invading microorganisms, and 
phytoalexins are directly microbicidal. 
C The presence of an ayirulent pathogen and/or changes in the concentrations of 0 2 ", 

J5 H2O2 and NO in the environment surrounding a plant cell modulate the activities of many 
£ genes and, therefore, the levels of many gene products. Examples of tobacco mosaic virus 
Rj (TMV) responsive genes and gene products, many of them operating through an ROS 

m 

ff ; : signaling system, are shown in The Reference and Sequence Tables. These genes and/or 
L products are responsible for effects on traits such as plant vigor and seed yield. The genes 
CS0 were discovered and characterized from a much larger set by experiments designed to find 
j; genes whose mRNA products changed in response to application of TMV to plants. 

Microarray technology allows monitoring of gene expression levels for thousands of 
genes in a single experiment. This is achieved by hybridizing labeled fluorescent cDNA pools 
to glass slides that contain spots of DNA (Schena et al. (1995) Science 270: 467-70). The US 
25 Arabidopsis Functional Genomics Consortium (AFGC) has recently made public the results 
from such microarray experiments conducted with AFGC chips containing some 10,000 non- 
redundant ESTs, selected from about 37,000 randomly sequenced ESTs generated from 
mRNA of different tissues and developmental stages. 

The sequences of the ESTs showing at least two-fold increases or decreases in 
30 response to TMV infection over the non infected controls were identified, compared to the 
Ceres full length cDNA and genomic sequence databanks, and equivalent Ceres clones 
identified. The MA_diff table(s) report(s) the results of this analysis, indicating those Ceres 
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clones which are up or down regulated over controls, thereby indicating the Ceres clones 
which represent disease responsive genes. 

Manipulation of one or more disease responsive gene activities are useful to modulate 
the biological processes and/or phenotypes listed below. Disease responsive genes and gene 
products can act alone or in combination. Useful combinations include disease responsive 
genes and/or gene products with similar transcription profiles, similar biological activities, or 
members of the same or functionally related biochemical pathways. Whole pathways or 
segments of pathways are controlled by transcription factor proteins and proteins controlling 
the activity of signal transduction pathways. Therefore, manipulation of such protein levels is 
especially useful for altering phenotypes and biochemical activities of plants. 

Such disease responsive genes and gene products can function to either increase or 
dampen the above phenotypes or activities either in response to changes in active oxygen 
concentration or in the absence of active oxygen fluctuations. The MA_diff Table(s) reports 
the transcript levels of the experiment (see EXPT ID: Disease (relating to SMD 7342, SMD 
7343)). For transcripts that had higher levels in the samples than the control, a "+" is shown. 
A "-" is shown for when transcript levels were reduced in root tips as compared to the 
control. For more experimental detail see the Example section below. 

Disease genes are those sequences that showed differential expression as compared to 
controls, namely those sequences identified in the MA_diff tables with a "+" or "-" indication 

Disease Genes Identified By Cluster Analyses Of Differential Expression 

Disease Genes Identified By Correlation To Genes That Are Differentially 

Expressed 

As described above, the transcription profiles of genes that act together are 
well correlated. Applicants not only have identified the genes that are differentially 
expressed in the microarray experiments, but also have identified the genes that act in concert 
with them. The MA_clust table indicates groups of genes that have well correlated 
transcription profiles and therefore participate in the same pathway or network. 

A pathway or network of Disease genes is any group in the MA_clust that 
comprises a cDNA ID that also appears in Expt ID Disease (relating to SMD 7342, SMD 
7343) of the MA_diff table(s). 
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Disease Genes Identified By Correlation To Genes That Cause Physiological 

Consequences 

Additionally, the differential expression data and the phenotypic observations 
can be merged to identify pathways or networks of Disease genes. A group in the MA_clust 
is considered a Disease pathway or network if the group comprises a cDNA ED that also 
appears in Knock-in or Knock-out tables that causes one or more of the phenotypes described 
in section above. 

Disease Genes Identified By Amino Acid Sequence Similarity 

Disease genes from other plant species typically encode polypeptides that 
share amino acid similarity to the sequences encoded by corn and Arabidopsis Disease genes. 
Groups of Disease genes are identified in the Protein Group table. In this table, any protein 
group that comprises a peptide ID that corresponds to a cDNA ID member of a Disease 
pathway or network is a group of proteins that also exhibits Disease functions/utilities. 

Further, promoters of disease responsive genes, as described in the Reference tables, 
for example, are useful to modulate transcription that is induced by disease or any of the 
following phenotypes or biological activities below. Further, any desired sequence can be 
transcribed in similar temporal, tissue, or enviromentally specific patterns as the disease 
responsive genes when the desired sequence is operably linked to a promoter of a disease 
responsive gene. 

V.L.I. USE OF DISEASE RESPONSIVE GENES, GENE COMPONENTS 
AND PRODUCTS TO MODULATE PHENOTYPES 
Disease responsive genes and gene products are useful to or modulate one or more of 
the following phenotypes: 

• Pathogen Tolerance and/or Resistance 

• Avr/R locus interactions 

• Non-Host interactions 
. HR 

. SAR 

Resistance to bacteria e.g. to Erwinia stewartii, Pseudomonas syringae, 
Pseudomonas tabaci, Stuart's wilt, etc. 
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Resistance to fungi e.g. to downy mildews such as Scleropthora macrospora, 
Sclerophthora rayissiae, Sclerospora graminicola, Peronosclerospora sorghi, 
Peronosclerospora philippinensis, Peronosclerospora sacchari, Peronosclerospora 
maydis; rusts such as Puccinia sorphi, Puccinia polysora, Physopella zeae, etc.; 
and to other fungal diseases e.g. Cercospora zeae-maydis, Colletotrichum 
graminicola, Fusarium monoliforme, Exserohilum turcicum, Bipolaris maydis, 
Phytophthora parasitica, Peronospora tabacina, Septoria, etc.; 
Resistance to viruses or viroids e.g. to tobacco or cucumber mosaic virus, ringspot 
virus, necrosis virus, pelargonium leaf curl virus, red clover mottle virus, tomato 
bushy stunt virus, and like viruses; 

Resistance to insecs, such as to aphids e.g. Myzus persicae; to beetles and beetle 

larvae; to lepidoptera larvae e.g. Heliothus etc. 

Resistance to Nematodes, e.g. Meloidogyne incognita etc 

Local resistance in primary (infected) or secondary (uninfected) leaves 

Stress Tolerance 

Winter Survival 

Cold Tolerance 

Salt tolerance 

Heavy Metal Tolerance, such as Cadmium 
Tolerance to Physical Wounding; 

Increased Organelle Tolerance to Redox Stress, such as in Mitochondria, and 

chloroplasts 

Cell Death 

Programmed cell death, including death of diseased tissue and during senescence 

Fruit Drop 

Biomass 

Fresh and Dry Weight during any time in plant life, such as maturation 

Number of Flowers, Seeds, Branches, and/or Leaves 

Seed Yield, including Number, Size, Weight, and/or Harvest Index 

Fruit Yield, including Number, Size, Weight, and/or Harvest Index 

Plant Development 

Time to Fruit Maturity 

Cell Wall Strengthening and Reinforcement - 
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• Plant Product Quality 

• Paper making quality 

• Food additives 

• Treatment of Indications modulated by Free Radicals 
5 • Cancer 

• Kinds of low molecular weight compounds such as phytoalexins 

• Abundance of low molecular weight compounds such as phytoalexins 
Other phenotypes based on gene silencing 

10 To regulate any of the phenotype(s) above, activities of one or more of the disease 

responsive genes or gene products can be modulated and the plants can be tested by screening 
for the desired trait. Specifically, the gene, mRNA levels, or protein levels can be altered in a 
plant utilizing the procedures described herein and the phenotypes can be screened for variants 
O as in Winkler et al. (1998) Plant Physiol 118: 743-50 and assayed , for example, in accordance to 
Is Alvarez et al., (1998) Cell 92: 773-784; Halhbrock and Scheel, (1989) Ann. Rev. Plant Physiol. 
J Plant Mol. Biol. 40: 347-369; Lamb et al., (1997) Ann. Rev. Plant Mol. Biol. Plant Physiol. 48: 
nj 25 1-275; Lapwood et al. (1984) Plant Pathol. 33: 13-20; Levine et al. (1996) Curr. Biol. 6: 427- 
m 437; McKersie et al., (2000) Plant Physiol. 122: 1427-1437; Olson and Varner (1993) Plant J. 4: 
l h 887-892; Pastore et al., (2000), FEBS Lett 470: 88-92; Pastori et al., (1997) Plant Physiol. 113: 
§0 41 1-418; Romero-Puertas et al., (1999) Free Radic. Res. 199931 Suppl: S25-31; Shirataki et al., 
> Anticancer Res 20: 423-426 (2000); Wu et al., (1995) Plant Cell 7: 1357-1 368. 

TEST 

V.L.2. USE OF DISEASE RESPONSIVE GENES, GENE COMPONENTS 
AND PRODUCTS TO MODULATE BIOCHEMICAL ACTIVITIES 
25 The activities of one or more of the disease responsive genes can be 

modulated to change biochemical or metabolic activities and/or pathways such as 
those noted below. Such biological activities are documented and can be measured 
according to the citations above and included in the Table below: 





BIOCHEMICAL OR 




PROCESS 


METABOLIC ACTIVITIES 


CITATIONS INCLUDING 




AND/OR PATHWAYS 


ASSAYS 
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PROCESS 


BIOCHEMICAL OR 
METABOLIC ACTIVITIES 
AND/OR PATHWAYS 


CITATIONS INCLUDING 
ASSAYS 


Resistance to Pathogens 


Induction of ROS signaling 
pathways 


Wu et.al.(1995) Plant Cell 7: 
1357-68 




Modulation of nitric oxide 
signaling 


Delledonne et al. (1998) Nature 
394: 585-588 




Induction of PR proteins, 
phytoalexins, and defense 
pathways 


Chamnongpol et.al.(1998) Proc. 
Nat.Acad Sci USA 12;95:5818- 
23. 

Davis etal. (1993) 
Phytochemistry 32: 607-61 1 




Induction of cellular 
protectant genes such as 
glutathione S-transferase 
(GST) and ascorbate 
peroxidase 


Chen et.al. Plant J. (1996) 
10:955-966 

Gadea et.al.(1999) Mol Gen 
Genet 262:212-219 
Wuet.al.(1995) Plant Cell 7: 
1357-68 




ROS levels following 
wounding and changes in 
physical pressure 


Orozco-Cardenas and Ryan 

(1999) Proc.Nat. Acad. Sci. USA 

25;96:6553-7. 

Yahraus et al. (1995) Plant 

Physiol. 109: 1259-1266 




Salicyclic acid levels and 
signaling 


Durner and Klessig (1996) 
J.Biol.Chem. 271:28492-501 


Responses to Wounding 


Expression of genes Involved 
in wound repair and cell 
division 


Legendre et al. (1993) Plant 
Physiol. 102: 233-240 


Responses to Environmental 
Stress 


Expression of genes involved 
in responses to drought, cold, 
salt, heavy metals 


Shi et al. (2000) Proc. Natl. Acad. 
Sci. USA 97:6896-6901 
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PROCESS 


BIOCHEMICAL OR 
METABOLIC ACTIVITIES 
AND/OR PATHWAYS 


CITATIONS INCLUDING 
ASSAYS 


Reinforcement of Cell Walls 


Modulation of the Production 
of ExtracTable Proline-Rich 
Protein 


Bradley et al. (1992) Cell 70, 21- 
30 




Modulation of Lignification 


Mansouri et al. (1999) Physiol. 
Plant 106: 355-362 


Programmed Cell Death 


Induction of PCD activating 
genes 


Levine et al. (1996) Curr. Biol. 6: 
427-437. Reynolds et.al. (1998) 
Biochem.J. 330:115-20 




Suppression of PCD 
suppressing genes 


Pennell and Lamb (1997) Plant 
Cell 9, 1157-1168 



Other biological activities that can be modulated by the disease responsive genes and 
their products are listed in the Reference Table. Assays for detecting such biological 
activities are described in the Protein Domain table. 

Disease responsive genes are characteristically differentially transcribed in response 
to fluctuating levels of disease. The MA_diff table(s)report(s) the changes in transcript levels 
of various disease responsive genes in the aerial parts of a plant 3 days after the plant was 
sprayed with a suspension of TMV relative to control plants sprayed with water. 

The data from this experiment reveal a number of types of disease responsive genes 
and gene products, including "early responders," and "delayed responders". Profiles of 
individual disease responsive genes are shown in the Table below with examples of which 
associated biological activities are modulated when the activities of one or more such genes 
vary in plants. 
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GENE 

EXPRESSION 
LEVELS 


FUNCTIONAL 
CATEGORY 
OF GENE 


PHYSIOLOGICAL 
CONSequence 


EXAMPLES OF 
BIOCHEMICAL 
ACTIVITY 

OF GENE PRODUCTS 


Upregulated 
transcripts 


Early 

Responders to 
Pathogens 


ROS Perception and 
Response 


Transcription factors, 
kinases, phosphatases, GTP- 
binding proteins (G- 
proteins), leucine rich repeat 
proteins (LRRs), 
transporters, calcium 
binding proteins, chromatin 
remodeling proteins 






Initiation of Gene 
Transcription 


Glutathione S -transferase 
(GST), 

heat shock proteins, 
salicylic acid (SA) response 
pathway proteins, jasmonate 
response pathway proteins, 
dehydrins, peroxidases, 
catalases 




Delayed 
Responders to 
Pathogens 


Initiation of Defence 
Gene Transcription 


Proteases, pathogen 
response (PR) proteins, 
cellulases, chitinases, 
cutinases, glucanases, other 
degrading enzymes, calcium 
channel blockers, 
phenylalanine ammonia 
lyase, proteins of defense 
pathways, cell wall proteins 
incuding proline rich 
proteins and glycine rich 
proteins, epoxide hydrolase, 
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methyl transferases 






Activation of cell death 
pathways 


Transcription factors 
kinases, phosphatases, DNA 
surveillance proteins, p53, 
proteases, endonucleases, 
GTP-binding proteins (G- 
proteins), leucine rich repeat 
proteins (LRRs), 
transporters, calcium 
binding proteins, 
mitochondrial and 
chloroplast energy related 
proteins, ribosome 
inactivating proteins 






Initiation of Cellular 
Protectant Gene 
Transcription 


Reactive oxygen scavenging 
enzymes, GST, catalase, 
peroxidase, ascorbate 
oxidase 


Downregulated 
transcripts 


Early 

responders to 
pathogens 


Negative regulation of 
pathogen inducible 
pathways released 


Transcription factors, 
kinases, phosphatases, GTP- 
binding proteins (G- 
proteins), leucine rich repeat 
proteins (LRRs), 
transporters, calcium 
binding proteins, chromatin 
remodelling proteins 




Genes repressed 
by TMV 


Negative regulation of 
ROS inducible 
pathways released 


Transcription factors, 
kinases, phosphatases, GTP- 
binding proteins (G- 
proteins), leucine rich repeat 
proteins (LRRs), 
transporters, calcium 
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binding proteins, chromatin 
remodelling proteins 




Delayed 
Responders to 
Pathogens 


Negative regulation of 
pathogen inducible 
pathways released 


Transcription factors, 
kinases, phosphatases, GTP- 
binding proteins (G- 
proteins), leucine rich repeat 
proteins (LRRs), 
transporters, calcium 
binding proteins, chromatin 
remodelling proteins 




Genes repressed 
by TMV 


Negative regulation of 
genes suppressing 
programmed cell death 
released 


Transcription factors, 
kinases, phosphatases, GTP- 
binding proteins (G- 
proteins), leucine rich repeat 
proteins (LRRs), 
transporters, calcium 
binding proteins, chromatin 
remodelling proteins 



Use of Promoters of Disease Responsive Genes 

Promoters of Disease responsive genes are useful for transcription of any desired 
polynucleotide or plant or non-plant origin. Further, any desired sequence can be transcribed 
in a similar temporal, tissue, or environmentally specific patterns as the Disease responsive 
genes where the desired sequence is operably linked to a promoter of a Disease responsive 
gene. The protein product of such a polynucleotide is usually synthesized in the same cells, 
in response to the same stimuli as the protein product of the gene from which the promoter 
was derived. Such promoter are also useful to produce antisense mRNAs to down-regulate 
the product of proteins, or to produce sense mRNAs to down-regulate mRNAs via sense 
suppression. 
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V.M. DEFENSE (LOL2) RESPONSIVE GENES, GENE COMPONENTS AND 
PRODUCTS 

Often growth and yield are limited by the ability of a plant to tolerate stress conditions, 
including pathogen attack. To combat such conditions, plant cells deploy a battery of inducible 
defense responses, including the triggering of an oxidative burst and the transcription of 
pathogenesis-related protein (PR protein) genes. Reactive oxygen species (ROS) such as H 2 0 2 
and NO from the oxidative burst play a signaling role, including initiation of the hypersensitive 
response (HR) and induction of systemic acquired resistance (SAR) to secondary infection by 
unrelated pathogens. Some PR proteins are able to degrade the cell walls of invading 
microorganisms, and phytoalexins are directly microbicidal. Other defense related pathways are 
regulated by salicylic acid (SA) or methyl jasmonate (MeJ). 

These responses depend on the recognition of a microbial avirulence gene product (avr) 
by a plant resistance gene product (R), and a series of downstream signaling events leading to 
transcription- independent and transcription-dependent disease resistance responses. Current 
models suggest that R- gene-encoded receptors specifically interact with pathogen-encoded 
ligands to trigger a signal transduction cascade. Several components include ndrl and edsl loci. 
NDR1, EDS1, PR1, as well as PDF1.2, a MeJ regulated gene and Niml, a SA regulated gene, 
are differentially regulated in plants with mutations in the LOL2 gene. 

LOL2 shares a novel zinc finger motif with LSD1, a negative regulator of cell death 
and defense response. Due to an alternative splice site the LOL2 gene encodes two different 
proteins, one of which contains an additional, putative DNA binding motif. Northern analysis 
demonstrated that LOL2 transcripts containing the additional DNA binding motif are 
predominantly upregulated after treatment with both virulent and avirulent Pseudomonas 
syringae pv maculicola strains. Modulation in this gene can also confer enhanced resistance 
to virulent and avirulent Peronospora parasitica isolates 

Examples of LOL2 responsive genes and gene products are shown in the Reference, 
Sequence, Protein Group, Protein Group Matrix, MA_diff and MA_clust tables. These genes 
and/or products are responsible for effects on traits such as plant vigor, disease resistance, 
and seed yield. The genes were discovered and characterized from a much larger set by 
microarray experiments designed to find genes whose mRNA products changed when the 
LOL2 gene was mutated in plants. 

Microarray technology allows monitoring of gene expression levels for thousands of 
genes in a single experiment. This is achieved by hybridizing labeled fluorescent cDNA pools 
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to glass slides that contain spots of DNA (Schena et al. (1995) Science 270: 467-70). The US 
Arabidopsis Functional Genomics Consortium (AFGC) has recently made public the results 
from such microarray experiments conducted with AFGC chips containing some about 
10,000 non-redundant ESTs, selected from about 37,000 randomly sequenced ESTs 
5 generated from mRNA of different tissues and developmental stages. 

The sequences of the ESTs showing at least two-fold increases or decreases in plants 
with the LOL2 mutation versus wildtype were obtained. Specifically, the plant line lol-2-2 
tested, a loss of function mutation. The ESTs were compared to the Ceres full length cDNA 
and genomic sequence databanks, and equivalent Ceres clones identified. The MA_diff table 
10 reports the results of this analysis, indicating those Ceres clones which are up or down 

regulated over controls, thereby indicating the Ceres clones which represent LOL2 responsive 
genes. 

Manipulation of one or more LOL2 responsive gene activities are useful to modulate 
^ the biological processes and/or phenotypes listed below. LOL2 responsive genes and gene 
£5 products can act alone or in combination. Useful combinations include LOL2 responsive 
J genes and/or gene products with similar transcription profiles, similar biological activities, or 
fy members of the same or functionally related biochemical pathways. Whole pathways or 
ffi segments of pathways are controlled by transcription factor proteins and proteins controlling 
f** the activity of signal transduction pathways. Therefore, manipulation of such protein levels is 
StO especially useful for altering phenotypes and biochemical activities of plants. 
£ Such LOL2 responsive genes and gene products can function to either increase or 

h: dampen the above phenotypes or activities either in response to changes in active LOL2 gene 

or in the absence. The MA_diff Table(s) reports the transcript levels of the experiment (see 

EXPT ID: lol2 (relating to SMD 8031, SMD 8032)). For transcripts that had higher levels in 
25 the samples than the control, a "+" is shown. A "-" is shown for when transcript levels were 

reduced in root tips as compared to the control. For more experimental detail see the 

Example section below. 

Defense genes are those sequences that showed differential expression as compared to 

controls, namely those sequences identified in the MA_diff tables with a "+" or indication 

30 
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Defense Genes Identified By Cluster Analyses Of Differential Expression 

Defense Genes Identified By Correlation To Genes That Are Differentially 



Expressed 



As described above, the transcription profiles of genes that act together are 



well correlated. Applicants not only have identified the genes that are differentially 
expressed in the microarray experiments, but also have identified the genes that act in concert 
with them. The MA_clust table indicates groups of genes that have well correlated 
transcription profiles and therefore participate in the same pathway or network. 



comprises a cDNA ID that also appears in Expt ID lol2 (relating to SMD 8031, SMD 8032) 
oftheMA_difftable(s). 

Defense Genes Identified By Correlation To Genes That Cause Physiological 

Consequences 

Additionally, the differential expression data and the phenotypic observations 
can be merged to identify pathways or networks of Defense genes. A group in the MA_clust 
is considered a Defense pathway or network if the group comprises a cDNA ID that also 
appears in Knock-in or Knock-out tables that causes one or more of the phenotypes described 
in section above. 

Defense Genes Identified By Amino Acid Sequence Similarity 



share amino acid similarity to the sequences encoded by com and Arabidopsis Defense genes. 
Groups of Defense genes are identified in the Protein Group table. In this table, any protein 
group that comprises a peptide ID that corresponds to a cDNA ID member of a Defense 
pathway or network is a group of proteins that also exhibits Defense functions/utilities. 

Further, promoters of LOL2 responsive genes, as described in the Reference tables, 
for example, are useful to modulate transcription that is induced by LOL2 responsive genes 
or any of the following phenotypes or biological activities below. Further, any desired 
sequence can be transcribed in similar temporal, tissue, or enviromentally specific patterns as 
the LOL2 responsive genes when the desired sequence is operably linked to a promoter of a 
LOL2 responsive gene. 



A pathway or network of Defense genes is any group in the MA_clust that 



Defense genes from other plant species typically encode polypeptides that 
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V.M. 1 . USE OF LOL2 RESPONSIVE GENES, GENE COMPONENTS AND 
PRODUCTS TO MODULATE PHENOTYPES 

Lol2 Responsive Genes And Gene Products Are Useful To Or Modulate One Or 
More Of The Following Phenotypes: 

• Pathogen Tolerance and/or Resistance 

• Avr/R locus interactions 

• Non-Host interactions 



SAR, e.g., disease responsive genes acting in conjunction with infection with any 
of the organisms listed below 

• Resistance to bacteria e.g. to Erwinia stewartii, Pseudomonas syringae, 
Pseudomonas tabaci, Stuart's wilt, etc. 

• Resistance to fungi e.g. to downy mildews such as Scleropthora macrospora, 
Sclerophthora rayissiae, Sclerospora graminicola, Peronosclerospora sorghi, 
Peronosclerospora philippinensis, Peronosclerospora sacchari, Peronosclerospora 
maydis; rusts such as Puccinia sorphi, Puccinia polysora, Physopella zeae, etc.; 
and to other fungal diseases e.g. Cercospora zeae-maydis, Colletotrichum 
graminicola, Fusarium monoliforme, Exserohilum turcicum, Bipolaris maydis, 
Phytophthora parasitica, Peronospora tabacina, Septoria, etc.; 

• Resistance to viruses or viroids e.g. to tobacco or cucumber mosaic virus, ringspot 
virus, necrosis virus, pelargonium leaf curl virus, red clover mottle virus, tomato 
bushy stunt virus, and like viruses; 

• Resistance to insecs, such as to aphids e.g. Myzus persicae; to beetles and beetle 
larvae; to lepidoptera larvae e.g. Heliothus etc. 

• Resistance to Nematodes, e.g. Meloidogyne incognita etc 

• Local resistance in primary (infected) or secondary (uninfected) leaves 
Stress Tolerance 

• Winter Survival 
Cold Tolerance 
Salt tolerance 

• Heavy Metal Tolerance, such as Cadmium 

• Tolerance to Physical Wounding; 



HR 
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. Increased Organelle Tolerance to Redox Stress, such as in Mitochondria, and 

chloroplasts 
. Cell Death 

• Programmed cell death, including death of diseased tissue and during senescence 

• Fruit Drop 
Biomass 

. Fresh and Dry Weight during any time in plant life, such as maturation 

• Number of Flowers, Seeds, Branches, and/or Leaves 

. Seed Yield, including Number, Size, Weight, and/or Harvest Index 

. Fruit Yield, including Number, Size, Weight, and/or Harvest Index 

• Plant Development 

• Time to Fruit Maturity 

. Cell Wall Strengthening and Reinforcement 

• Plant Product Quality 

• Paper making quality 

• Food additives 

• Treatment of Indications modulated by Free Radicals 
Cancer 

• Kinds of low molecular weight compounds such as phytoalexins 

• Abundance of low molecular weight compounds such as phytoalexins 
Other phenotypes based on gene silencing 

To regulate any of the phenotype(s) above, activities of one or more of the LOL2 
responsive genes or gene products can be modulated and the plants can be tested by screening 
for the desired trait. Specifically, the gene, mRNA levels, or protein levels can be altered in a 
plant utilizing the procedures described herein and the phenotypes can be screened for 
variants as in Winkler et al. (1998) Plant Physiol 118: 743-50 and assayed , for example, in 
accordance to Alvarez et al., (1998) Cell 92: 773-784; Halhbrock and Scheel, (1989) Ann. 
Rev. Plant Physiol. Plant Mol. Biol. 40: 347-369; Lamb et al., (1997) Ann. Rev. Plant Mol. 
Biol. Plant Physiol. 48: 251-275; Lapwood et al. (1984) Plant Pathol. 33: 13-20; Levine et al. 
(1996) Curr. Biol. 6: 427-437; McKersie et al., (2000) Plant Physiol. 122: 1427-1437; Olson 
and Varner (1993) Plant J. 4: 887-892; Pastore et al., (2000), FEBS Lett 470: 88-92; Pastori 
et al., (1997) Plant Physiol. 113: 411-418; Romero-Puertas et al., (1999) Free Radic. Res. 
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1999 31 Suppl: S25-31; Shirataki et al., Anticancer Res 20: 423-426 (2000); Wu et al., (1995) 
Plant Cell 7: 1357-1368. 

V.M.2.USE OF DEFENSE RESPONSIVE GENES TO MODULATE 
BIOCHEMICAL ACTIVITIES 

The activities of one or more of the defense (LOL2) responsive genes can be 
modulated to change biochemical or metabolic activities and/or pathways such as 
those noted below. Such biological activities are documented and can be measured 
according to the citations above and included in the Table below: 



PROCESS 


BIOCHEMICAL OR 
METABOLIC ACTIVITIES 
AND/OR PATHWAYS 


CITATIONS INCLUDING 
ASSAYS 


Resistance To Pathogens 


Induction Of ROS Signaling 
Pathways 


Wu et.al.(1995) Plant Cell 7: 
1357-68 




Modulation Of Nitric Oxide 
Signaling 


Delledonne et al. (1998) Nature 
394: 585-588 




Induction Of PR Proteins, 
Phytoalexins, And Defense 
Pathways 


Chamnongpol et.al.(1998) Proc. 
Nat.Acad Sci USA 12;95:5818-23. 
Davis et al. (1993) Phytochemistry 
32: 607-611 




Induction Of Cellular 
Protectant Genes Such As 
Glutathione S-Transferase 
(GST) And Ascorbate 
Peroxidase 


Chen et.al. Plant J. (1996) 10:955- 
966 

Gadea et.al.(1999) Mol Gen Genet 
262:212-219 

Wu et.al.(l 995) Plant Cell 7: 
1357-68 




ROS Levels Following 
Wounding And Changes In 
Physical Pressure 


Orozco-Cardenas and Ryan (1999) 

Proc.Nat. Acad. Sci. USA 

25;96:6553-7. 

Yahraus et al. (1995) Plant 

Physiol. 109: 1259-1266 
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PROCESS 


BIOCHEMICAL OR 
METABOLIC ACTIVITIES 
AND/OR PATHWAYS 


CITATIONS INCLUDING 
ASSAYS 




Salicyclic Acid Levels And 
Signaling 


Durner and Klessig (1996) 
J.Biol.Chem. 271:28492-501 


Responses To Wounding 


Expression Of Genes Involved 
In Wound Repair And Cell 
Division 


Legendre et al. (1993) Plant 
Physiol. 102: 233-240 


Responses To 
Environmental Stress 


Expression Of Genes Involved 
In Responses To Drought, 
Cold, Salt, Heavy Metals 


Shi et al. (2000) Proc. Natl. Acad. 
Sci. USA 97:6896-6901 


Reinforcement Of Cell 
Walls 


Modulation Of The Production 
Of ExtracTable Proline-Rich 
Protein 


Bradley et al. (1992) Cell 70, 21- 
30 




Modulation Of Lignification 


Mansouri et al. (1999) Physiol. 
Plant 106: 355-362 


Programmed Cell Death 


Induction Of Pcd Activating 
Genes 


Levine et al. (1996) Curr. Biol. 6: 
427-437. Reynolds et.al. (1998) 
Biochem.J. 330:115-20 




Suppression Of PCD 
Suppressing Genes 


Pennell and Lamb (1997) Plant 
Cell 9, 1157-1168 



Other biological activities that can be modulated by the LOL2 responsive genes and 
their products are listed in the Reference tables. Assays for detecting such biological 
activities are described in the Protein Domain table. 

LOL2 responsive genes are characteristically differentially transcribed in response to 
fluctuating levels of disease. MA_diff table reports the changes in transcript levels of various 
LOL2 responsive genes in the lol-2 line versus control plants. 

The data from this experiment reveal a number of types of LOL2 responsive genes 
and gene products. Profiles of individual LOL2 responsive genes are shown in the Table 
below with examples of which associated biological activities are modulated when the 
activities of one or more such genes vary in plants. 
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GENE 


FUNCTIONAL 


PHYSIOLOGICAL 


EXAMPLES OF 


EXPRESSION 


CATEGORY 


CONSequence 


BIOCHEMICAL 


LEVELS 


OF GENE 




ACTIVITY 








OF GENE PRODUCTS 


Upregulated 


Early 


ROS Perception and 


Transcription factors, 


transcripts 


Responders to 


Response 


kinases, phosphatases, GTP- 




the LOL2 




binding proteins (G- 




Mutation 




proteins), leucine rich repeat 








proteins (LRRs), 








transporters, calcium 








binding proteins, chromatin 








remodeling proteins 






Initiation of Gene 


Glutathione S-transferase 






Transcription 


(GST), 








heat shock proteins, 








salicylic acid (SA) response 








pathway proteins, jasmonate 








response pathway proteins, 








dehydrins, peroxidases, 








catalases 




Delayed 


Initiation of Defence 


Proteases, pathogen 




Responders to 


Gene Transcription 


response (PR) proteins, 




the LOL2 




cellulases, chitiriases, 




Mutation 




cutinases, glucanases, other 








degrading enzymes, calcium 








channel blockers, 








phenylalanine ammonia 








lyase, proteins of defense 








pathways, cell wall proteins 








incuding proline rich 








proteins and glycine rich 








proteins, epoxide hydrolase, 








methyl transferases 
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GENE 


FUNCTIONAL 


PHYSIOLOGICAL 


EXAMPLES OF 


EXPRESSION 


CATEGORY 


CONSequence 


BIOCHEMICAL 


LEVELS 


OF GENE 




ACTIVITY 








OF GENE PRODUCTS 






Activation of cell death 


Transcription factors 






pathways 


kinases, phosphatases, DNA 








surveillance proteins, p53, 








proteases, endonucleases, 








GTP-binding proteins (G- 








proteins), leucine rich repeat 








proteins (LRRs), 








transporters, calcium 








binding proteins, 








mitochondrial and 








chloroplast energy related 








proteins, ribosome 








inactivating proteins 






Initiation of Cellular 


Reactive oxygen scavenging 






Protectant Gene 


enzymes, GST, catalase, 






Transcription 


peroxidase, ascorbate 








oxidase 


Downregulated 


Early 


Negative regulation of 


Transcription factors, 


transcripts 


Responders to 


LOL2 Mutation 


kinases, phosphatases, GTP- 




the LOL2 


inducible pathways 


binding proteins (G- 




Mutation 


released 


proteins), leucine rich repeat 








proteins (LRRs), 








transporters, calcium 








binding proteins, chromatin 








remodelling proteins 




Genes 


Negative regulation of 


Transcription factors, 




Repressed by 


ROS inducible 


kinases, phosphatases, GTP- 




the LOL2 


pathways released 


binding proteins (G- 



Page 530 of 772 




Attorney DKt. Z/3U-148JF 
Client Dkt. 80090.005 



GENE 

EXPRESSION 
LEVELS 


FUNCTIONAL 
CATEGORY 
OF GENE 


PHYSIOLOGICAL 
CONSequence 


EXAMPLES OF 
BIOCHEMICAL 
ACTIVITY 

OF GENE PRODUCTS 




Mutation 




proteins), leucine rich repeat 
proteins (LRRs), 
transporters, calcium 
binding proteins, chromatin 
remodelling proteins 




Delayed 
Responders to 
the LOL2 
Mutation 


Negative regulation of 
LOL2 Mutation 
inducible pathways 
released 


Transcription factors, 
kinases, phosphatases, GTP- 
binding proteins (G- 
proteins), leucine rich repeat 
proteins (LRRs), 
transporters, calcium 
binding proteins, chromatin 
remodelling proteins 




Genes 

Repressed By 
The LOL2 
Mutation 


Negative Regulation Of 
Genes Suppressing 
Programmed Cell 
Death Released 


Transcription Factors, 
Kinases, Phosphatases, 
GTP-Binding Proteins (G- 
Proteins), Leucine Rich 
Repeat Proteins (Lrrs), 
Transporters, Calcium 
Binding Proteins, 
Chromatin Remodelling 
Proteins 



Use of Promoters of Defense Responsive Genes 

Promoters of Defense responsive genes are useful for transcription of any desired 
polynucleotide or plant or non-plant origin. Further, any desired sequence can be transcribed 
in a similar temporal, tissue, or environmentally specific patterns as the Defense responsive 
genes where the desired sequence is operably linked to a promoter of a Defense responsive 
gene. The protein product of such a polynucleotide is usually synthesized in the same cells, 
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in response to the same stimuli as the protein product of the gene from which the promoter 
was derived. Such promoter are also useful to produce antisense mRNAs to down-regulate 
the product of proteins, or to produce sense mRNAs to down-regulate mRNAs via sense 
suppression. 



Page 532 of 772 




Attorney Ukt. ^7^U-1483F 
Client Dkt. 80090.005 



V.N. IRON RESPONSIVE GENES, GENE COMPONENTS AND PRODUCTS 
Iron (Fe) deficiency in humans is the most prevalent nutritional problem worldwide 
today. Increasing iron availability via diet is a sustainable malnutrition solution for many of 
the world's nations. One-third of the world's soils, however, are iron deficient. 
Consequently, to form a food-based solution to iron malnutrition, we need a better 
understanding of iron uptake, storage and utilization by plants. Furthermore, exposure to non- 
toxic Fe levels appears to affect inherent plant defense mechanisms. Consequently, exploring 
the effects of Fe exposure has potential for advances in plant disease resistance in addition to 
human nutrition. 

Microarray technology allows monitoring of gene expression levels for thousands of 
genes in a single experiment. This is achieved by simultaneously hybridizing two 
differentially labeled fluorescent FeNA pools to glass slides that contain spots of DNA 
(Schena et al. (1995) Science 270: 467-70). The Arabidopsis Functional Genomics 
Consortium (AFGC) has recently made public the results from such microarray experiments 
conducted with AFGC chips containing 10,000 non-redundant ESTs, selected from 37,000 
randomly sequenced ESTs generated from mRNA of different tissues and developmental 
stages. 

The sequences of the ESTs showing at least two-fold increases or decreases over the 
controls were identified, compared to the Ceres full length FeNA and genomic sequence 
databanks, and identical Ceres clones identified. MA_diff table reports the results of this 
analysis, indicating those Ceres clones that are up or down regulated over controls, thereby 
indicating the Ceres clones which are iron responsive genes. 

The MA_diff Table(s) reports the transcript levels of the experiment (see EXPT ID: 
Iron (relating to SMD 7114, SMD 7115, SMD 7125)). For transcripts that had higher levels 
in the samples than the control, a "+" is shown. A "-" is shown for when transcript levels 
were reduced in root tips as compared to the control. For more experimental detail see the 
Example section below. 

Iron genes are those sequences that showed differential expression as compared to 
controls, namely those sequences identified in the MA_diff tables with a or "-" indication 



Iron Genes Identified By Cluster Analyses Of Differential Expression 



Iron Genes Identified By Correlation To Genes That Are Differentially 



Expressed 
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As described above, the transcription profiles of genes that act together are 
well correlated. Applicants not only have identified the genes that are differentially 
expressed in the microarray experiments, but also have identified the genes that act in concert 
with them. The MA_clust table indicates groups of genes that have well correlated 
5 transcription profiles and therefore participate in the same pathway or network. 

A pathway or network of Iron genes is any group in the MA_clust that 
comprises a cDNA ID that also appears in Expt ID Iron (relating to SMD 7114, SMD 7115, 
SMD 7125) of the MA_diff table(s). 

10 Iron Genes Identified By Correlation To Genes That Cause Physiological 

Consequences 

Additionally, the differential expression data and the phenotypic observations 
can be merged to identify pathways or networks of Iron genes. A group in the MA_clust is 
S considered a Iron pathway or network if the group comprises a cDNA ID that also appears in 
iJ5 Knock-in or Knock-out tables that causes one or more of the phenotypes described in section 
above. 



w 

ui 

fV\ Iron Genes Identified By Amino Acid Sequence Similarity 

%. Iron genes from other plant species typically encode polypeptides that share 

ISO amino acid similarity to the sequences encoded by corn and Arabidopsis Iron genes. Groups 
g of Iron genes are identified in the Protein Group table. In this table, any protein group that 
comprises a peptide ID that corresponds to a cDNA ID member of a Iron pathway or 
network is a group of proteins that also exhibits Iron functions/utilities. 

25 

V.N.I. USE OF IRON RESPONSIVE GENES TO MODULATE 
PHENOTYPES 

Iron responsive genes and gene products are useful to or modulate one or more 
of the following phenotypes: 
30 . Growth 

Roots 

• Root hair formation 
Stems 
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• Leaves 
Development 

• Senescence 

• Plant nutrition 

• Uptake and assimilation of organic compounds 

• Uptake and assimilation of inorganic compounds 

• Animal (including human) nutrition 

• Improved dietary mineral nutrition 

• Stress Response 

• Metabolic detoxification 

• Heavy metals 

To improve any of the phenotype(s) above, activities of one or more of the 
iron responsive genes or gene products can be modulated and tested by screening for 
the desired trait. Specifically, the gene, mRNA levels, or protein levels can be altered 
in a plant utilizing the procedures described herein and the phenotypes can be 
assayed. As an example, a plant can be transformed according to Bechtold and 
Pelletier(1998, Methods. Mol. Biol. 82:259-266) and visually inspected for the 
desired phenotype or metabolically and/or functionally assayed according to Schmidt 
et al. (2000, Plant Physiol 122:1109-18), Meagher (2000) Current Opinion in Plant 
Biology 3: 153-62), Deak (1999, Nature Biotechnology (1999, Nature Biotechnology 
17: 192-96), Wei and Theil (2000, J. Biol Chem 275: 17488-93) and Vansuyt et al. 
(1997, FEBS Letters 410: 195-200). 

V.N.2. USE OF IRON-RESPONSIVE GENES, GENE COMPONENTS AND 
PRODUCTS TO MODULATE BIOCHEMICAL ACTIVITIES 

The activities of one or more of the iron responsive genes can be modulated to 
change biochemical or metabolic activities and/or pathways such as those noted 
below. Such biological activities can be measured according to the citations included 
in the Table below: 
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PROCESS 


BIOCHEMICAL OR 
METABOLIC ACTIVITIES 
AND/OR PATHWAYS 


CITATIONS INCLUDING 
ASSAYS 


Growth , Differentiation 
and Development 


• Root Growth 

- Initiation of root 
hairs 


Robinson et al. (1999) 
Nature 397: 694-97 


Metabolisms 


• Iron sensing 

• Iron uptake and transport 

-decreased iron 
transport 

-phytoremediation 


Thomine et al. (2000) 
PNAS USA 97:4991-6 
Thomine et al. (2000) 
PNAS USA 97:4991-6 
Zhu(1999) Plant 
Physiol 11 9: 73-79 


Plant Defenses 


Protection from oxidative 
damage 


Deak(1999) Nature 
Biotechnology 17: 192- 
6 


Signaling 


Specific gene 
transcription gene 
silencing 


Brand and Perrimon 
(1993) Development 
118:401-415 



Other biological activities that can be modulated by the iron responsive genes and 
gene products are listed in the REFERENCE Table. Assays for detecting such biological 
activities are described in the Protein Domain table. 

Iron responsive genes are characteristically differentially transcribed in response to 
fluctuating iron levels or concentrations, whether internal or external to an organism or cell. 
MA_diff table reports the changes in transcript levels of various iron responsive genes. 

The microarray comparison consists of probes prepared from root RNA of A. thaliana 
(Columbia) seedlings grown under iron-sufficient conditions and seedlings grown under iron- 
deficient. The data from this experiment reveal a number of types genes and gene products. 
Profiles of these different iron responsive genes are shown in the Table below with examples 
of associated biological activities. 
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TRANSCRIPT 


TYPE OF GENES 


PHYSIOLOGICAL 


EXAMPLES OF 


LEVELS 




CONSEQUENCES 


BIOCHEMICAL 








ACTIVITY 


Up regulated 


responders to iron 


• Iron perception 


Transporters 


transcripts 


application 


• Iron uptake and 


• Metabolic enzymes 






transport 


Change in cell 






Iron metabolism 


membrane structure 






Synthesis of 


and potential 






secondary 


• Kinases and 






metabolites 


phosphatases 






and/or proteins 


Transcription 








activators 






Modulation of 


Change in chromatin 






iron response 


structure and/or 






transduction 


localized DNA 






pathways 


topology 






Specific gene 








transcription 








initiation 




Down-regulated 


responder to iron 


• Negative 


• Transcription factors 


transcripts 


repressors of iron state 


regulation of iron 


Change in protein 




of metabolism 


pathways 


structure by 








phosphorylation 




Genes with 


Changes in 


(kinases) or 




discontinued 


pathways and 


dephosphoryaltion 




expression or 


processes 


(phosphatases) 




unsTable mRNA in 


operating in cells 


• Change in chromatin 




presence of iron 


Changes in other 


structure and/or 






metabolisms than 


DNA topology 






iron 


• Stability of factors 








for protein synthesis 
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TRANSCRIPT 
LEVELS 


TYPE OF GENES 


PHYSIOLOGICAL 
CONSEQUENCES 


EXAMPLES OF 
BIOCHEMICAL 
ACTIVITY 








and degradation 
Metabolic enzymes 



Use of Promoters of Iron Responsive Genes 

Promoters of Iron responsive genes are useful for transcription of any desired 
polynucleotide or plant or non-plant origin. Further, any desired sequence can be transcribed 
in a similar temporal, tissue, or environmentally specific patterns as the Iron responsive genes 
where the desired sequence is operably linked to a promoter of a Iron responsive gene. The 
protein product of such a polynucleotide is usually synthesized in the same cells, in response 
to the same stimuli as the protein product of the gene from which the promoter was derived. 
Such promoter are also useful to produce antisense mRNAs to down-regulate the product of 
proteins, or to produce sense mRNAs to down-regulate mRNAs via sense suppression. 
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V.O, SHADE RESPONSIVE GENES, GENE COMPONENTS AND PRODUCTS 

Plants sense the ratio of Red (R) : Far Red (FR) light in their environment and 
respond differently to particular ratios. A low R:FR ratio, for example, enhances cell 
elongation and favors flowering over leaf production. The changes in R:FR ratios mimic and 
cause the shading response effects in plants. The response of a plant to shade in the canopy 
structures of agricultural crop fields influences crop yields significantly. Therefore 
manipulation of genes regulating the shade avoidance responses can improve crop yields. 
While phytochromes mediate the shade avoidance response, the down-stream factors 
participating in this pathway are largely unknown. One potential downstream participant, 
ATHB-2, is a member of the HD-Zip class of transcription factors and shows a strong and 
rapid response to changes in the R:FR ratio. ATHB-2 overexpressors have a thinner root 
mass, smaller and fewer leaves and longer hypocotyls and petioles. This elongation arises 
from longer epidermal and cortical cells, and a decrease in secondary vascular tissues, 
paralleling the changes observed in wild-type seedlings grown under conditions simulating 
canopy shade. On the other hand, plants with reduced ATHB-2 expression have a thick root 
mass and many larger leaves and shorter hypocotyls and petioles. Here, the changes in the 
hypocotyl result from shorter epidermal and cortical cells and increased proliferation of 
vascular tissue. Interestingly, application of auxin is able to reverse the root phenotypic 
consequences of high ATHB-2 levels, restoring the wild-type phenotype. Consequently, 
given that ATHB-2 is tightly regulated by phytochrome, these data suggest that ATHB-2 may 
link the auxin and phytochrome pathways in. the shade avoidance response pathway. 

Changes in R:FR ratios promote changes in gene expression. Microarray technology 
allows monitoring of gene expression levels for thousands of genes in a single experiment. 
This is achieved by hybridizing labeled fluorescent cDNA pools to glass slides that contain 
spots of DNA (Schena et al. (1995) Science 270: 467-70). The US Arabidopsis Functional 
Genomics Consortium (AFGC) has recently made public the results from such microarray 
experiments conducted with AFGC chips containing about 10,000 non-redundant ESTs, 
selected from about37,000 randomly sequenced ESTs generated from mRNA of different 
tissues and developmental stages. 

The sequences of the ESTs showing at least two-fold increases or decreases in plants 
given 4 hours of FR rich light after growth in high R:FR light compared with the controls of 
plants grown in high R:FR light only, were identified, compared to the Ceres full length 
cDNA and genomic sequence databanks, and equivalent Ceres clones identified. The 
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MA_diff table(s) report(s) the results of this analysis, indicating those Ceres clones which are 
up or down regulated over controls, thereby indicating the Ceres clones which are shade 
avoidance responsive genes. 

Examples of far red light induced, shade avoidance responsive genes and gene 
products are shown in the Reference and Sequence Tables. These genes and/or products are 
responsible for effects on traits such as plant vigor and seed yield. 

While far red light, shade avoidance responsive polynucleotides and gene products 
can act alone, combinations of these polynucleotides also affect growth and development. 
Useful combinations include different shade avoidance responsive polynucleotides and/or 
gene products that have similar transcription profiles or similar biological activities, and 
members of the same or similar biochemical pathways. In addition, the combination of a 
shade avoidance responsive polynucleotide and/or gene product with another 
environmentally responsive polynucleotides is also useful because of the interactions that 
exist between hormone regulated pathways, stress and pathogen induced pathways, 
nutritional pathways, light induced pathways and development. Here, in addition to 
polynucleotides having similar transcription profiles and/or biological activities, useful 
combinations include polynucleotides that may have different transcription profiles but which 
participate in common or overlapping pathways. 

Such far red light induced shade avoidance responsive genes and gene 
products can function to either increase or dampen the above phenotypes or activities either 
in response to changes in far red light or in the absence of far red light fluctuations. The 
MA diff Table(s) reports the transcript levels of the experiment (see EXPT ED: Shade 
(relating to SMD 8130, SMD 7230)). For transcripts that had higher levels in the samples 
than the control, a "+" is shown. A "-" is shown for when transcript levels were reduced in 
root tips as compared to the control. For more experimental detail see the Example section 
below. 

Shade genes are those sequences that showed differential expression as compared to 
controls, namely those sequences identified in the MA_diff tables with a "+" or "-" indication 

Shade Genes Identified By Cluster Analyses Of Differential Expression 

Shade Genes Identified By Correlation To Genes That Are Differentially 

Expressed 
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As described above, the transcription profiles of genes that act together are 
well correlated. Applicants not only have identified the genes that are differentially 
expressed in the microarray experiments, but also have identified the genes that act in concert 
with them. The MA_clust table indicates groups of genes that have well correlated 
transcription profiles and therefore participate in the same pathway or network. 

A pathway or network of Shade genes is any group in the MA_clust that 
comprises a cDNA ID that also appears in Expt ID Shade (relating to SMD 8130, SMD 7230) 
of the MA_difftable(s). 

Shade Genes Identified By Correlation To Genes That Cause Physiological 

Consequences 

Additionally, the differential expression data and the phenotypic observations 
can be merged to identify pathways or networks of Shade genes. A group in the MA_clust is 
considered a Shade pathway or network if the group comprises a cDNA ID that also appears 
in Knock-in or Knock-out tables that causes one or more of the phenotypes described in 
section above. 

Shade Genes Identified By Amino Acid Sequence Similarity 

Shade genes from other plant species typically encode polypeptides that share 
amino acid similarity to the sequences encoded by corn and Arabidopsis Shade genes. 
Groups of Shade genes are identified in the Protein Group table. In this table, any protein 
group that comprises a peptide ID that corresponds to a cDNA ID member of a Shade 
pathway or network is a group of proteins that also exhibits Shade functions/utilities. 

Further, promoters of shade avoidance responsive genes, as described in the 
Reference tables, for example, are useful to modulate transcription that is induced by shade 
avoidance or any of the following phenotypes or biological activities below. Further, any 
desired sequence can be transcribed in similar temporal, tissue, or environmentally specific 
patterns as the shade avoidance responsive genes when the desired sequence is operably 
linked to a promoter of a circadian (clock) responsive gene. 
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V.O.I. USE OF FAR RED RESPONSIVE, SHADE AVOIDANCE 

RESPONSE GENES TO MODULATE PHENOTYPES 
High FR:R, shade avoidance responsive genes and gene products can be used to 
alter or modulate one or more of the following phenotypes: 

• Growth 
Roots 

• Elongation 

• Lateral root formation 
Stems 

• Elongation 
Expansion 

• Leaves 

• Expansion 
Carotenoid composition 

• Development 
. Cell 

• Growth 

• Elongation 

• Photosynthetic apparatus 
Efficiency 

• Flower 

• Flowering time 

• Fruit 
. Seed 

• Dormancy 

• Control rate and timing of germination 

• Prolongs seed storage and viability 

• Inhibition of hydrolytic enzyme synthesis 
Seed and Fruit yield 

• Senescence 

• Abscission 
. Leaf fall 

• Flower longevity 
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. Differentiation 
. Vascularization 
. Shade (Avoidance) Responses in plant architecture 

To regulate any of the phenotype(s) above, activities of one or more of the High FR: R 
light, shade avoidance responsive genes or gene products can be modulated and the plants tested 
by screening for the desired trait. Specifically, the gene, mRNA levels, or protein levels can be 
altered in a plant utilizing the procedures described herein and the phenotypes can be assayed. 
As an example, a plant can be transformed according to Bechtold and Pelletier (1998, Methods. 
Mol. Biol. 82:259-266) and/or screened for variants as in Winkler et al. (1998) Plant Physiol 
118: 743-50 and visually inspected for the desired phenotype or metabolically and/or 
functionally assayed according to Carabelli et al. (1996, PNAS USA 93: 3530-3535), 
Aguirrezabal and Tardieu (1996, J Exp Bot 47: 41 1-20), Heyer et al. (1995, Plant Physiol 109: 
53-61), Garcia-Plazaola et al. (1997, J Exp Bot 48: 1667-74), Schwanz et al. (1996, J Exp Bot 
47L 1941-50), Koehne et al. (1999, Biochem Biophys Acta 1412:94-107), Melis (1984, J Cell 
Biochem 24: 271-85), Steindeler et al. (1999, Development 126: 4235-45), Cruz (1997, J Exp 
Bot 48: 15-24), Stephanou and Manetas (1997, J Exp Bot 48: 1977-85), Grammatikopoulos et al 
(1999, J Exp Bot 50:517-21), Krause et al. (1999, Plant Physiol 121 : 1349-58), Aukerman et al. 
(1997, Plant Cell 9: 1317-26), Wagner et al. (1997, Plant Cell 9: 731-43), Weinig (2000) 
Evolution Int J Org Evolution 54: 124-26), Cocburn et al. (1996, J Exp Bot 47: 647-53), Devlin 
et al. (1999, Plant Physiol 119: 909-15), Devlin et al. (1998, Plant Cell 10: 1479-87), Finlayson 
et al. (1998, Plant Physiol 116: 17-25), Morelli and Ruberti (2000, Plant Physiol 122: 621-26), 
Aphalo et al. (1999, J Exp Bot 50: 1629-34), Sims et al. (1999, J Exp Bot 50: 50: 645-53) and 
Ballare (1999, Trends Plant Sci 4: 97-102). 
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V.O.2. USE OF FAR RED LIGHT, SHADE AVOIDANCE RESPONSIVE 
GENES TO MODULATE BIOCHEMICAL ACTIVITIES 

The activities of one or more of the far red light, shade avoidance responsive 
genes can be modulated to change biochemical or metabolic activities and/or 
pathways such as those noted below. Such biological activities can be measured 
according to the citations included in the Table below: 



/ 
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PROCESS 


BIOCHEMICAL OR 
METABOLIC 
ACTIVITIES AND/OR 
PATHWAYS 


CITATIONS INCLUDING 
ASSAYS 


Cell Growth and 
Differentiation 


Cell Elongation 


Carabelli et al. (1996) PNAS USA 
93: 3530-35 




Leaf Expansion 


Heyer et al. (1995) Plant Physiol 
109: 53-61 


Photosynthesis 


Development of 
Photosynthetic Apparatus 


Jagtap et al. (1998) J Exp Bot 49: 
1715-21 

Melis (1984) J Cell Biochem 24: 
271-285 

McCain (1995) Biophys J 69: 1 105- 
10 




Carotenoid Composition 


Garcia-Plazaola et al (1997) J Exp 
Bot 48: 1667-74 


Carbon/Nitrogen 
Metabolism 


Carbon and Nitrogen 
Assimilation 


Cruz (1997) J Exp Bot 48: 15-24 


Far red light, shade 
avoidance response 
binding by transcription 
factors 




Newton AL, Sharpe BK, Kwan A, 
Mackay JP, Crossley M. J Biol 
Chem. 2000Mayl9;275(20):15128- 
34; Lopez Ribera I, Ruiz-Avila L, 
Puigdomenech P. Biochem Biophys 
Res Commun. 1997 Jul 
18;236(2):510-6; de Pater S, Greco 
V, Pham K, Memelink J, Kijne J. 
Nucleic Acids Res. 1996 Dec 1; 
24(23):4624-31. . 


Signaling 


UV Light Perception 


Stephanou and Manetas (1997) J 
Exp Bot 48: 1977-85 
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PROCESS 


BIOCHEMICAL OR 
METABOLIC 
ACTIVITIES AND/OR 
PATHWAYS 


CITATIONS INCLUDING 
ASSAYS 




Far-red/Red Light 
Perception 


Aukerman et al. (1997) Plant Cell 
9: 1317-26 

Wagner et al. (1997) Plant Cell 9: 
731-43 




Interaction of "Shade 
Factor" with Ethylene 
Production/Transduction 


Finlayson et al. (1998) Plant 
Physiol 116: 17-25 




Interaction of "Shade 
Factor" with Auxin 
Production/Transduction 


Reed et al. (1998) Plant Physiol 
118: 1369-78 




Plant to Plant signalling 


Sims et al. (1999) J Exp Bot 50: 
645-53 



Other biological activities that can be modulated by shade avoidance response genes and 
their products are listed in the REF TABLES. Assays for detecting such biological activities are 
described in the Protein Domain table. 

High FR:R, shade avoidance responsive genes are differentially transcribed in 
response to high FR:R ratios. The microarray comparison to reveal such genes consisted of 
probes prepared from RNA isolated from the aerial tissues of A. thaliana (Columbia) two- 
week old seedlings grown in high R:FR ratios compared to seedlings grown in high R:FR 
ratios followed by 4 hours of FR-rich light treatment. The data from this experiment reveal a 
number of types genes and gene products and examples of the classes of genes are given in 
the Table below. 
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TRANSCRIPT 


TYPE OF GENES 


PHYSIOLOGICAL 


EXAMPLES OF 


LEVELS 




CONSEQUENCES 


BIOCHEMICAL 








ACTIVITY 


Up regulated 


Responders to high 


• Far red light 


• Transporters 


transcripts 


FR:R light ratios 


perception 


Metabolic enzymes 






• Metabolism 


Change in cell 






affected by far red 


membrane structure 




Genes induced by 


light 


and potential 




high FR:R light ratio 


• Synthesis of 


• Kinases and 






secondary 


phosphatases 






metabolites and/or 


Transcription 






proteins 


activators 








• Change in chromatin 






• Modulation of 


structure and/or 






high FR:R light 


localized DNA 






ratio transduction 


topology 






pathways 


Leaf production 






Specific gene 


factors 






transcription 








initiation 




Down-regulated 


Responders to high 


Changes in 


• Transcription factors 


transcripts 


FR:R light ratios 


pathways and 


Change in protein 






processes 


structure by 




Genes repressed by 


operating in cells 


phosphorylation 




high FR:R light ratio 


Changes in 


(kinases) or 






metabolisms other 


dephosphorylation 




Genes with 


than far red 


(phosphatases) 




discontinued 


stimulated 


• Change in chromatin 




expression or 


pathways 


structure and/or DNA 




unsTable mRNA 




topology 




during high FR:R 




Stability of factors for 
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TRANSCRIPT 



TYPE OF GENES 



PHYSIOLOGICAL 



EXAMPLES OF 



LEVELS 



CONSEQUENCES 



BIOCHEMICAL 



ACTIVITY 



ratio light 



protein synthesis and 
degradation 
Metabolic enzymes 
Cell elongation factors 
Flowering promotion 



factors 



Use of Promoters of Shade Avoidance Genes 

Promoters of Shade Avoidance genes are useful for transcription of any desired 
polynucleotide or plant or non-plant origin. Further, any desired sequence can be transcribed 
in a similar temporal, tissue, or environmentally specific patterns as the Shade Avoidance 
genes where the desired sequence is operably linked to a promoter of a Shade Avoidance 
gene. The protein product of such a polynucleotide is usually synthesized in the same cells, 
in response to the same stimuli as the protein product of the gene from which the promoter 
was derived. Such promoter are also useful to produce antisense mRNAs to down-regulate 
the product of proteins, or to produce sense mRNAs to down-regulate mRNAs via sense 
suppression. 
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V.P. SULFUR RESPONSIVE GENES, GENE COMPONENTS AND 
PRODUCTS 

Sulfur is one of the important macronutrients required by plants. It is taken up from 
the soil solution by roots as in the form of sulfate anion which higher plants are dependent on 
to fulfill their nutritional sulfur requirement. After uptake from the soil, sulfate is either 
accumulated and stored in vacuole or it is assimilated into various organic compounds, e.g. 
cysteine, glutathione, methionine, etc. Thus, plants also serve as nutritional sulfur sources for 
animals. Sulfur can be assimilated in one of two ways: it is either incorporated as sulfate in a 
reaction called sulfation, or it is first reduced to sulfide, the substrate for cysteine synthesis. 
In plants, majority of sulfur is assimilated in reduced form. 

Sulfur comprises a small by vital fraction of the atoms in many protein molecules. As 
disulfide bridges, the sulfur atoms aid in stabilizing the folded proteins, such cysteine 
residues. Cys is the first sulfur-containing amino acids, which in proteins form disulfide 
bonds that may affect the tertiary structures and enzyme activities. This redox balance is 
mediated by the disulfide/thiol interchange of thioredoxin or glutaredoxin using NADPH as 
an electron donor. Sulfur can also become sulfhydryl (SH) groups participating in the active 
sites of some enzymes and some enzymes require the aid of small molecules that contain 
sulfur. In addition, the machinery of photosynthesis includes some sulfur-containing 
compounds, such as ferrodoxin.. Thus, sulfate assimilation plays important roles not only in 
the sulfur nutrition but also in the ubiquitous process that may regulate the biochemical 
reactions of various metabolic pathways. 

Deficiency of sulfur leads to a marked chlorosis in younger leaves, which may 
become white in color. Other symptoms of sulfur deficiency also include weak stems and 
reduced growth. Adding sulfur fertilizer to plants can increase root development and a 
deeper green color of the leaves in sulfur-deficient plants. However, Sulfur is generally 
sufficient in soils for two reasons: it is a contaminant in potassium and other fertilizers and a 
product of industrial combustion. Sulfur limitation in plants is thus likely due to the 
limitation of the uptake and distribution of sulfate in plants. Seven cell type specific sulfate 
transporter genes have been isolated from Arabidopsis. In sulfate-starved plants, expression 
of the high-affinity transporter, AtSTl-1, is induced in root epidermis and cortex for 
acquisition of sulfur. The low affinity transporter, AtST2-l (AST68), accumulates in the root 
vascular tissue by sulfate starvation for root-to-shoot transport of sulfate. These studies have 
shown that the whole-plant process of sulfate transport is coordinately regulated by the 
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expression of these 2 sulfate transporter genes under sulfur limited conditions. Recent studies 
have proposed that feeding of O-acetylserine, GSH and selenate may regulate the expression 
of AtSTl-1 and AtST2-l (AST68) in roots either positively or negatively. However, 
regulatory proteins that may directly control the expression of these genes have not been 
identified yet. 

It has been established that there are regulatory interactions between assimilatory 
sulfate and nitrate reduction in plants. The two assimilatory pathways are very similar and 
well coordinated; deficiency for one element was shown to repress the other pathway. The 
coordination between them should be taken into consideration when one tries to alter one of 
pathways. 

Microarray technology allows monitoring of gene expression levels for thousands of 
genes in a single experiment. This is achieved by simultaneously hybridizing two 
differentially labeled fluorescent cDNA pools to glass slides that contain spots of DNA 
(Schena et al. (1995) Science 270: 467-70). The Arabidopsis Functional Genomics 
Consortium (AFGC) has recently made public the results from such microarray experiments 
conducted with AFGC chips containing 10,000 non-redundant ESTs, selected from 37,000 
randomly sequenced ESTs generated from mRNA of different tissues and developmental 
stages. 

The sequences of the ESTs showing at least two-fold increases or decreases over the 
controls were identified, compared to the Ceres full-length cDNA and genomic sequence 
databanks, and identical Ceres clones identified. MA_diff table reports the results of this 
analysis, indicating those Ceres clones which are up or down regulated over controls, thereby 
indicating the Ceres clones which are sulfur response responsive genes. 

The MA_diff Table(s) reports the transcript levels of the experiment (see EXPT ID: 
Sulfur (relating to SMD 8034, SMD 8035)). For transcripts that had higher levels in the 
samples than the control, a "+" is shown. A "-" is shown for when transcript levels were 
reduced in root tips as compared to the control. For more experimental detail see the 
Example section below. 

Sulfur genes are those sequences that showed differential expression as compared to 
controls, namely those sequences identified in the MA_diff tables with a "+" or "-" indication 



Sulfur Genes Identified By Cluster Analyses Of Differential Expression 
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Sulfur Genes Identified By Correlation To Genes That Are Differentially 

Expressed 

As described above, the transcription profiles of genes that act together are 
well correlated. Applicants not only have identified the genes that are differentially 
expressed in the microarray experiments, but also have identified the genes that act in concert 
with them. The MA_clust table indicates groups of genes that have well correlated 
transcription profiles and therefore participate in the same pathway or network. 

A pathway or network of Sulfur genes is any group in the MA_clust that 
comprises a cDNA ED that also appears in Expt ID Sulfur (relating to SMD 8034, SMD 
8035) of the MA_diff table(s). 

Sulfur Genes Identified By Correlation To Genes That Cause Physiological 

Consequences 

Additionally, the differential expression data and the phenotypic observations 
can be merged to identify pathways or networks of Sulfur genes. A group in the MA_clust is 
considered a Sulfur pathway or network if the group comprises a cDNA ID that also appears 
in Knock-in or Knock-out tables that causes one or more of the phenotypes described in 
section above. 

Sulfur Genes Identified By Amino Acid Sequence Similarity 

Sulfur genes from other plant species typically encode polypeptides that share 
amino acid similarity to the sequences encoded by corn and Arabidopsis Sulfur genes. 
Groups of Sulfur genes are identified in the Protein Group table. In this table, any protein 
group that comprises a peptide ED that corresponds to a cDNA ID member of a Sulfur 
pathway or network is a group of proteins that also exhibits Sulfur functions/utilities. 

V.P.I. USE OF SULFUR RESPONSIVE GENES TO MODULATE 
PHENOTYPES 

Sulfur responsive genes and gene products are useful to or modulate one or 
more of the following phenotypes: 

• Growth 

• Roots 
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• Stems 

• Leaves 

• Development 

• Chloroplasts and Mitochondria 

• Fruit Development 
Seed Development 

• Seed storage proteins 

• Senescence 

• Differentiation 

• Plastid/chloroplast and mitochondria differentiation 

• Protection against oxidative damage 

• regulation of enzymes via redox control by thiol groups 

• Metabolic detoxification 

• Photosynthesis 

• Carbon dioxide fixation 

To improve any of the phenotype(s) above, activities of one or more of the 
sulfur responsive genes or gene products can be modulated and tested by screening 
for the desired trait. Specifically, the gene, mRNA levels, or protein levels can be 
altered in a plant utilizing the procedures described herein and the phenotypes can be 
assayed. As an example, a plant can be transformed according to Bechtold and 
Pelletier (1998, Methods. Mol. Biol. 82:259-266) and visually inspected for the 
desired phenotype or metabolically and/or functionally assayed according to Saito et 
al. (1994, Plant Physiol. 106: 887-95), Takahashi et al (1997, Proc. Natl. Acad. Sci. 
USA 94: 1 1 102-07) and Koprivova et al. (2000, Plant Physiol. 122: 737-46). 

V.P.2. USE OF SULFUR-RESPONSIVE GENES, GENE COMPONENTS 
AND PRODUCTS TO MODULATE BIOCHEMICAL ACTIVITIES 

The activities of one or more of the sulfur responsive genes can be modulated 
to change biochemical or metabolic activities and/or pathways such as those noted 
below. Such biological activities can be measured according to the citations included 
in the Table below: 
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PROCESS 


BIOCHEMICAL OR 


CITATIONS INCLUDING 




METABOLIC ACTIVITIES 


ASSAYS 




AND/OR PATHWAYS 




Growth , 


. Root 


Klein and Klein (1988) Mineral 


Differentiation and 


• Leaf 


Nutrition, In CM Wilson and J 


Development 


Stem 


Gregory, eds Fundamentals of 




• Chloroplast/Mitochondria 


Plant Science. Harper and Row 




development/differentiation 


Publishers, Inc., NY, pi 63 






Rost et al. (1984) The Absorption 






and Transport System, In R Bern, 






ed, Botany- A Brief Introduction to 






Plant Biology. John Wiley and 






Sons, NY, p96. 






Huluigue et al. (2000) Biochem 






Biophys Res Commun 271: 380-5 






Kapazoglou et al. (2000) Eur J 






Biochem 267: 352-60 




Seed storage protein 


Kim et al.(1999) 209: 282-9 




synthesis 




Metabolisms 


• Sulfate uptake and transport 


Takahashi et al. (1997) Proc Natl 






Acad SciUSA94: 11102-07 




Cysteine Biosynthesis 


Saito et al. (1992) Proc Natl Acad 






Sci USA 89: 8078-82 






Hesse et al. (1999) Amino Acids 






16: 113-31 




♦ Methionine biosynthesis 


Bourgis et al. (1999) Plant Cell 1 1 : 






1485-98 




• Carbon dioxide fixation in 


Buchana (1991) Arch Biochem 




photosynthesis 


Biophys 288: 1-9 




Thioredoxin reduction 


Leustek and Saito (1999) Plant 






Phyiol 120: 637-43 
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PROCESS 


BIOCHEMICAL OR 


CITATIONS INCLUDING 




METABOLIC ACTIVITIES 


ASSAYS 




AND/OR PATHWAYS 








Mamedova et al. (1999) FEBS 






Lett 462: 421-4 




Nitrogen metabolism 


Koprivova et al. (2000) Plant 






Physiol. 122: 737-46 






Yamaguchi et al. (1999) Biosci 






Biotechnol Biochem 63: 762-6 


Plant Defenses 


Reduction of oxidative 


May et al. (1998) J Expt Bio 49: 




stress - oxygen metabolism 


649-67 




and reactive oxygen species 


Kreuz et al. (1996) Plant Physiol 




• Detoxification of toxins, 


111:349-53 




xenobiotics and heavy 


Zhao et al. (1998) Plant Cell 10: 




metals 


359-70 




• Defense against pathogens 


Kyung and Fleming (1997) J Food 




or microbes 


Prot60: 67-71 




. Disease prevention by 


Fahey et al. (1997) Proc Natl Acad 




secondary sulfur-containing 


Sci USA 94: 10367-72 




compounds 






• Activation of kinases and 


Davis et al. (1999) Plant Cell 11: 




phosphatases 


1179-90 



Other biological activities that can be modulated by the sulfur responsive genes and 
gene products are listed in the REFERENCE Table. Assays for detecting such biological 
activities are described in the Protein Domain table. 

Sulfur responsive genes are characteristically differentially transcribed in response to 
fluctuating sulfur levels or concentrations, whether internal or external to an organism or cell. 
MA_diff table reports the changes in transcript levels of various sulfur responsive genes. 

Profiles of these different sulfur responsive genes are shown in the Table 

below with examples of associated biological activities. 
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TRANSCRIPT 


TYPE OF GENES 


PHYSIOLOGICAL 


EXAMPLES OF 


LEVELS 




CONSEQUENCES 


BIOCHEMICAL 








ACTIVITY 


Up regulated 


Responders.to sulfur 


• Sulfur perception 


• Transporters 


transcripts 


Application 


• Sulfur uptake and 


. Metabolic enzymes 






transport 


Change in cell 






Sulfur metabolism 


membrane structure 






Synthesis of 


and potential 






secondary 


• Kinases and 






metabolites and/or 


phosphatases 






proteins 


• Transcription 








activators 






• Modulation of 


• Change in chromatin 






sulfur response 


structure and/or 






transduction 


localized DNA 






pathways 


topology 






• Specific gene 


Redox control 






transcription 








initiation 




Down-regulated 


responder to sulfur 


• Negative 


• Transcription factors 


transcripts 


repressors of sulfur 


regulation of 


Change in protein 




state of metabolism 


sulfur pathways 


structure by 








phosphorylation 




Genes with 


• Changes in 


(kinases) or 




discontinued 


pathways and 


dephosphoryaltion 




expression or 


processes 


(phosphatases) 




unsTable mRNA in 


operating in cells 


• Change in chromatin 




presence of sulfur 


Changes in other 


structure and/or DNA 






metabolisms than 


topology 






sulfur 


• Stability of factors for 








protein synthesis and 








degradation 
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TRANSCRIPT 
LEVELS 


TYPE OF GENES 


PHYSIOLOGICAL 
CONSEQUENCES 


EXAMPLES OF 
BIOCHEMICAL 
ACTIVITY 








• Metabolic enzymes 



Use of Promoters of Sulfur Responsive Genes 

Promoters of Sulfur responsive genes are useful for transcription of any desired 
polynucleotide or plant or non-plant origin. Further, any desired sequence can be transcribed 
in a similar temporal, tissue, or environmentally specific patterns as the Sulfur responsive 
genes where the desired sequence is operably linked to a promoter of a Sulfur responsive 
gene. The protein product of such a polynucleotide is usually synthesized in the same cells, 
in response to the same stimuli as the protein product of the gene from which the promoter 
was derived. Such promoter are also useful to produce antisense mRNAs to down-regulate 
the product of proteins, or to produce sense mRNAs to down-regulate mRNAs via sense 
suppression. 
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V.Q. ZINC RESPONSIVE GENES, GENE COMPONENTS AND PRODUCTS 
Phytoremediation of soils contaminated with toxic levels of heavy metals requires the 
understanding of plant metal transport and tolerance. The numerous Arabidopsis thaliana 
studies have given scientists the potential for dissection and elucidation of plant 
micronutrient/heavy metal uptake and accumulation pathways. It has been shown altered 
regulation of ZNT1, a Zn/Cd transporter, contributes to high Zn uptake. Isolation and 
characterization of Zn/Cd hyperaccumulation genes may allow expression in higher biomass 
plant species for efficient contaminated soil clean up. Identification of additional Zn 
transport, tolerance and nutrition-related genes involved in heavy metal accumulation will 
enable manipulation of increased uptake (for phytoremediation) as well as limitation of 
uptake or leak pathways that contribute to toxicity in crop plants. Additionally, Zn-binding 
ligands involved in Zn homeostasis or tolerance may be identified, as well as factors affecting 
the activity or expression of Zn binding transcription factors. Gene products acting in concert 
to effect Zn uptake, which would not have been identified in complementation experiments, 
including multimeric transporter proteins, could also be identified. 

Microarray technology allows monitoring of gene expression levels for thousands of 
genes in a single experiment. This is achieved by simultaneously hybridizing two 
differentially labeled fluorescent cDNA pools to glass slides that contain spots of DNA 
(Schena et al. (1995) Science 270: 467-70). The Arabidopsis Functional Genomics 
Consortium (AFGC) has recently made public the results from such microarray experiments 
conducted with AFGC chips containing 10,000 non-redundant ESTs, selected from 37,000 
randomly sequenced ESTs generated from mRNA of different tissues and developmental 
stages. 

The sequences of the ESTs showing at least two-fold increases or decreases over the 
controls were identified, compared to the Ceres full-length cDNA and genomic sequence 
databanks, and identical Ceres clones identified. The Zn response information was then used 
in conjunction with the existing annotation to attribute biological function or utility to the 
full-length cDNA and corresponding genomic sequence. 

The MA_diff Table(s) reports the transcript levels of the experiment (see EXPT ID: 
Zinc (relating to SMD 7310, SMD 731 1)). For transcripts that had higher levels in the 
samples than the control, a "+" is shown. A "-" is shown for when transcript levels were 
reduced in root tips as compared to the control. For more experimental detail see the 
Example section below. 
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Zinc genes are those sequences that showed differential expression as compared to 
controls, namely those sequences identified in the MA_diff tables with a "+" or "-" indication 

Zinc Genes Identified By Cluster Analyses Of Differential Expression 

Zinc Genes Identified By Correlation To Genes That Are Differentially 

Expressed 

As described above, the transcription profiles of genes that act together are 
well correlated. Applicants not only have identified the genes that are differentially 
expressed in the microarray experiments, but also have identified the genes that act in concert 
with them. The MA_clust table indicates groups of genes that have well correlated 
transcription profiles and therefore participate in the same pathway or network. 

A pathway or network of Zinc genes is any group in the MA_clust that 
comprises a cDNA ID that also appears in Expt ID Zinc (relating to SMD 7310, SMD 731 1) 
oftheMA_difftable(s). 

Zinc Genes Identified By Correlation To Genes That Cause Physiological 

Consequences 

Additionally, the differential expression data and the phenotypic observations 
can be merged to identify pathways or networks of Zinc genes. A group in the MA_clust is 
considered a Zinc pathway or network if the group comprises a cDNA ID that also appears in 
Knock-in or Knock-out tables that causes one or more of the phenotypes described in section 
above. 

Zinc Genes Identified By Amino Acid Sequence Similarity 

Zinc genes from other plant species typically encode polypeptides that share 
amino acid similarity to the sequences encoded by corn and Arabidopsis Zinc genes. Groups 
of Zinc genes are identified in the Protein Group table. In this table, any protein group that 
comprises a peptide ID that corresponds to a cDNA ID member of a Zinc pathway or network 
is a group of proteins that also exhibits Zinc functions/utilities. 
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V.Q.I. USE OF ZN TRANSPORT, TOLERANCE AND NUTRITION- 
RELATED GENES TO MODULATE PHENOTYPES 

Changes in zinc concentration in the surrounding environment or in contact with a plant 
results in modulation of many genes and gene products. Examples of such zinc responsive genes 
and gene products are shown in the Reference, Sequence tables, Protein Group, Protein Group 
Matrix, MA_diff, and MA_clust tables. These genes and/or products are responsible for effects 
on traits such as plant vigor and seed yield. 

While zinc responsive polynucleotides and gene products can act alone, combinations 
of these polynucleotides also affect growth and development. Useful combinations include 
different zinc responsive polynucleotides and/or gene products that have similar transcription 
profiles or similar biological activities, and members of the same or similar biochemical 
pathways. In addition, the combination of a zinc responsive polynucleotide and/or gene 
product with another environmentally responsive polynucleotide is also useful because of the 
interactions that exist between hormone regulated pathways, stress pathways, nutritional 
pathways and development. Here, in addition to polynucleotides having similar transcription 
profiles and/or biological activities, useful combinations include polynucleotides that may 
have different transcription profiles but which participate in a common pathway. 

Such zinc responsive genes and gene products can function to either increase or 

dampen the above phenotypes or activities either 
in response to changes in zinc concentration or 

• in the absence of zinc fluctuations. 

Zn transport, tolerance and nutrition-related genes and gene products can be used 
to alter or modulate one or more of the following phenotypes: 

• Zn Uptake 

• Transport of Zn or other heavy metals into roots 

• Epidermal/cortical uptake 

• Xylem loading 

Zn compartmentation 

• Xylem unloading 

• Phloem loading 

• Efflux from cells to apoplast 

• Sequestration in vacuoles/subcellular compartments. 
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• Zn tolerance 

• Chelation of Zn 

• Transport of Zn 

• Metabolic and transcriptional control 
. Activity of Zn binding enzymes 

• Activity of Zn binding transcription factors 

To improve any of the phenotype(s) above, activities of one or more of the Zn transport, 
tolerance and nutrition-related genes or gene products can be modulated and the plants can be 
tested by screening for the desired trait. Specifically, the gene, mRNA levels, or protein levels 
can be altered in a plant utilizing the procedures described herein and the phenotypes can be 
assayed, for example, in accordance to Lasat MM, Pence NS, Garvin DF, Ebbs SD, Kochian 
LV. J Exp Bot. 2000 Jan;51(342):71-9; Grotz N, Fox T, Connolly E, Park W, Guerinot ML, 
Eide D. Proc Natl Acad Sci USA. 1998 Jun 9;95(12):7220-4; Crowder MW, Maiti MK, 
Banovic L, Makaroff CA. FEBS Lett. 1997 Dec l;418(3):351-4; Hart JJ, Norvell WA, Welch 
RM, Sullivan LA, Kochian LV. Plant Physiol. 1998 Sep;118(l):219-26. 

V.Q.2. USE OF ZN TRANSPORT, TOLERANCE AND NUTRITION- 
RELATED GENES TO MODULATE BIOCHEMICAL ACTIVITIES 
Alternatively, the activities of one or more of the zinc responsive genes can be 



modulated to change biochemical or metabolic activities and/or pathways such as 
those noted below. Such biological activities can be measured according to the 
citations included in the Table below: 



PROCESS 


BIOCHEMICAL OR 
METABOLIC 
ACTIVITIES AND/OR 
PATHWAYS 


CITATIONS INCLUDING 
ASSAYS 


Zn Uptake and 
Assimilation 


Zn Influx 


Lasat MM, Pence NS, Garvin DF, 
Ebbs SD, Kochian LV. J Exp Bot. 
2000 Jan;51(3.42):71-9. 




Zn compartmentation 


Hart JJ, Norvell WA, Welch RM, 
Sullivan LA, Kochian LV. Plant 
Physiol. 1998 Sep;118(l):219-26. 
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PROCESS 


BIOCHEMICAL OR 


CITATIONS INCLUDING 






METABOLIC 


ASSAYS 






ACTIVITIES AND/OR 








PATHWAYS 






Zn binding by metabolic 




Crowder MW, Maiti MK, Banovic 




enzymes 




L, Makaroff CA.FEBS Lett. 1997 








Dec l;418(3):351-4; Kenzior AL, 








Folk WR. FEBS Lett. 1998 Dec 








4;440(3):425-9. 




Zn binding by 




Newton AL, Sharpe BK, Kwan A, 




transcription factors 




Mackay JP, Crossley M. J Biol 








Chem. 2000Mayl9;275(20):15128- 








34; Lopez Ribera I, Ruiz-Avila L, 








Puigdomenech P. Biochem Biophys 


iS * 

■BBS* 






Res Commun. 1997 Jul 








18;236(2):510-6;de Pater S, Greco 


ru 






V, Pham K, Memelink J, Kijne J. 


ffi 






Nucleic Acids Res. 1996 Dec 


c : 

m 






1;24(23):4624-31. 




Synthesis of proteins to 




Schafer HJ, Greiner S, Rausch T, 


J=«. 

r : 


chelate Zn and other 




Haag-Kerwer A. FEBS Lett. 1997 


N : 


metals 




Mar 10;404(2-3):216-20. 








Rauser WE.Cell Biochem Biophys. 








1999;31(l):19-48. 




Synthesis of metabolites 




Rauser WE.Cell Biochem Biophys. 




to chelate Zn and other 




1999;31(l):19-48. 




metals 







Other biological activities that can be modulated by Zn transport, tolerance and nutrition- 
related genes and their products are listed in the Reference tables. Assays for detecting such 
biological activities are described in the Protein Domain table. 
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Zn transport, tolerance and nutrition-related genes are differentially transcribed in 
response to low Zn concentrations. The microarray comparison consists of probes prepared 
from root RNA of A. thaliana (Columbia) seedlings hydroponically grown in complete 
nutrient medium (control) and Zn deficient seedlings grown in -Zn nutrient medium 
(experimental). The data from this experiment reveal a number of types genes and gene 
products. MA_diff table reports the changes in transcript levels of various zinc responsive 
genes in entire seedlings at 1 and 6 hours after a plant was sprayed with a Hoagland's 
solution enriched with zinc as compared to seedlings sprayed with Hoagland's solution only. 

The data from this time course can be used to identify a number of types of zinc 
responsive genes and gene products, including "early responding," "high zinc responders," 
"repressors of zinc deprivation pathways" and "zinc deprivation responders." Profiles of 
these different zinc responsive genes are shown in the Table below with examples of 
associated biological activities. 



TRANSCRIPT 


TYPE OF GENE 


PHYSIOLOGICAL 


EXAMPLES OF 


LEVELS 




CONSequence 


BIOCHEMICAL 








ACTIVITY 


Upregulated 


Early responders to 


- Zinc Perception 


-Transcription 


transcripts (level at 


Zinc 


-Zinc Uptake 


Factors 


1 hour = 6 hours) 




- Modulation of Zinc 


-Transporters 


(level at 1 hour > 6 




Response Transduction 




hours) 




Pathways 








- Specific Gene 








Transcription Initiation 






Zinc Deprivation 




-Inhibit Transport of 




Responders 


-Repression of Pathways 


Zinc 






to Optimize Zinc 


-Degradation 






Response Pathways 




Level at 1 hour < 6 


Delayed Zinc 




-Zinc Metabolic 


hours 


Responders 




Pathways 




Repressor of Zinc 


Negative Regulation of 






Deprivation Pathways 


Zinc Pathways 
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TRANSCRIPT 
LEVELS 


TYPE OF GENE 


PHYSIOLOGICAL 
CONSequence 


EXAMPLES OF 
BIOCHEMICAL 
ACTIVITY 


Down Regulated 
transcripts (Level 
at 1 hour = 6 
hours) 

(Level at 6 hours > 
1 hour) 


Early responder 
repressors of Zinc 
utilization Pathways 


Negative Regulators of 
Zinc Utilization Pathways 


Suppressing Zinc 
Requiring processes 


Level at 1 hour > 6 
hours 


Genes with 
discontinued 
expression or 
unsTable mRNA 
following Zinc uptake 


Changes in pathways and 
processes operating I cells 





Use of Promoters of Zinc Responsive Genes 

Promoters of Zinc responsive genes are useful for transcription of any desired 
polynucleotide or plant or non-plant origin. Further, any desired sequence can be transcribed 
in a similar temporal, tissue, or environmentally specific patterns as the Zinc responsive 
genes where the desired sequence is operably linked to a promoter of a Zinc responsive gene. 
The protein product of such a polynucleotide is usually synthesized in the same cells, in 
response to the same stimuli as the protein product of the gene from which the promoter was 
derived. Such promoter are also useful to produce antisense mRNAs to down-regulate the 
product of proteins, or to produce sense mRNAs to down-regulate mRNAs via sense 
suppression. 
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VI. UTILITIES OF PARTICULAR INTEREST 

Genes capable of modulating the phenotypes in the following table are useful produce 
the associated utilities in the table. Such genes can be identified by their cDNA ED number in 
the Knock-in and Knock-out tables. 



Phenotype 
Modulated by a 
Gene 

Leaf shape Cordate 

Cup-shaped 
Curled 

Laceolate 

Lobed 

Oval 

Ovate 

Serrate 

Trident 
Undulate 

Vertically Oblong 

Other Shapes 



Long petioles 
Short petioles 



Utilities 



decrease wind opacity, 

decrease lodging (plant fall over), 

increase biomass by making larger or 

different shaped leaves, 

improve the efficiency of mechanical 

harvesting, 

decrease transpiration for better drought 
tolerance, 

changing leaf shape to collect and absorb 
water, 

modulation of canopy structure and shading 
for altered irradiance close to the ground, 

enhanced uptake of pesticides (herbicides, 
fungicides, etc), 

creation of ornamental leaf shapes, 
increase resistance to pathogens by 
decreasing amount of water that collects on 
leaves, 

change proporation of cell types in the 
leaves for enhanced photosynthesis, 
decreased transpiration, and enhanced 

accumulation of desirable compounds 
including secondary metabolites in 
specialized cells, 
decrease insect feeding, 

decrease wind opacity, 

decrease lodging (plant fall over), 

increase biomass by better positioning of 

the leaf blade, 

decrease insect feeding, 

decrease transpiration for better drought 

tolerance, 

position leaves most effectively for 
photosynthetic efficiency 



Fused 



ornamental applications to make distinctive 
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Phenotype 
Modulated by a 
Gene 



Reduced fertility 



Short siliques 



Reduced fertility 
Sterility 



Flower size 



Stature 



Large 
Small 



Dwarfs 



Meristems 
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Utilities 



| plants, 



increase or decrease the number of seeds in 
a fruit, 

increasing fruit size, 

modulating fruit shape to better fit 

harvesting or packaging requirements, 

useful for controlling dehisence and seed 

scatter 

useful in hybrid breeding programs, 
increasing fruit size, 
production of seedless fruit, 
useful as targets for gametocides, 
modulating fruit shape to better fit 
harvesting or packaging requirements, 
useful for controlling dehisence and seed 
scatter 

useful for edible flowers 

useful for flower derived products such as 

fragrances 

useful for modulating seed size and number 
in combination with seed-specific genes 

value in the ornamental industry 

increasing or decreasing plant biomass, 
optimizing plant stature to increase yield 
under various diverse environmental 
conditions, e.g., when water or nutrients 

are limiting, 
decreasing lodging, 
increasing fruit number and size, 
controlling shading and canopy effects 

Change plant architecture, 

increase or decrease number of leaves as 

well as change the types of leaves to 

increase biomass, 

improve photosynthetic efficiency, 

create new varieties of ornamental plants 

with enhanced leaf design, 
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Phenotype 
Modulated by a 
Gene 



Stem 



Strong 



Weak 



Late/Early Bolting 
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Utilities 



preventing flowering to opimize vegetative 
growth, 

control of apical dominace, 
increase or decrease flowering time to fit 
season, water or fertilizer schedules, 
change arrangement of leaves on the stem 
(phyllotaxy) to optimize plant density, 
decrease insect feeding, 

or decrease pathogen infection, 
increase number of trichome/glandular 
trichome producing leaves 
targets for herbicides, 
generate ectopic meristems and ectopic 
growth of vegetative and floral tissues and 
seeds and fruits 

modify lignin content/composition for 
creation of harder woods or reduce 
difficulty/costs in pulping for 

paper production or increase 
digestibility of forage crops, 
decrease lodging, 

modify cell wall polysaccharides in stems 
and fruits for improved texture and 
nutrition, 
increase biomass 

Break the need for long vernalization of 
vernalization-dependent crops, e.g., winter 
wheat, thereby increasing yield 
decrease or increase generaton time 
increase biomass 



Lethals 



Embryo-lethal 



Embryo-defective 



produce seedless fruit, 
use as herbicide targets 

produce seedless fruit, 
use as herbicide targets 



Seedling 



use as herbicide targets, 

useful for metabolic engineering, 



Pigment-lethals 



use as herbicide targets, 
increase photosynthetic efficiency 
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Phenotype 
Modulated by a 
Gene 



• 
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Utilities 



Pigment 



Dark Green 



YGV1 



YGV2 



6 



YGV3 



Interveinal chlorosis 



Roots 



Short (primary root) 



Thick (primary root) 



Increase nutritional value, 

enhanced photosynthesis and carbon 

dioxide combustion and therefore increase 

plant vigor and biomass, 

enhanced photosynthetic efficiency and 

therefore increase plant vigor and biomass, 

prolong vegetative development, 
enhanced protection against pathogens, 

Useful as targets for herbicides, 
increase photosynthetic efficiency and 
therefore increase plant vigor and biomass, 

Useful as targets for herbicides, 

control of change from embryonic to adult 

organs, 

increase metabolic efficiency, 
increase photosynthetic efficiency and 
therefore increased plant vigor and biomass, 

Useful as targets for herbicides, 
nitrogen sensing/uptake/usage, 
increase metabolic efficiency and therefore 
increased plant vigor and biomass, 

to increase photosynthetic efficiency and 
therefore increase plant vigor and biomass 

to increase or decrease nitrogen transport 
and therefore increase plant vigor and 
biomass 

use as herbicide targets 
increase metabolic efficiency, 

to access water from rainfall, 

to access rhizobia spray application, for 

anaerobic soils, 

useful to facilitate harvest of root crops, 

useful for increasing biomass of root crops, 
for preventing plants dislodging during 
picking and harvesting, 
as root grafts, for animal feeds 
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Modulated by a 
Gene 
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Utilities 



Branching (primary 
root) 



Long (lateral roots) 



m 

nj 



Agravitropic 

Curling (primary root) 



Trichome 



Poor germination 

Reduced Number 
Glabrous 

Increased Number 



Wax mutants 



modulation allows betters access to water, 
minerals, fertilizers, rhizobia prevent soil 
erosion,s 

increasing root biomass 
decrease root lodging, 

modulation allows improved access to 

water, nutrients, fertilizer, rhizobia, prevent 

soil erosion 

increase root biomass 

decrease root lodging 

modulation allows control on the depth of 

root growth in soil to access water and 

nutriennts 

modulation allows hormonal control of root 
growth and development (size) 

modulation allows control on the depth of 
root growth in soil 

modulation allows hormonal control of root 

growth and development (size) 

useful in anaerobic soils in allowing roots 

to stay close to surface 

harvesting of root crops 



Genes useful for decreasing transpiration, 
increased production of glandular trichomes 
for oil or other secreted chemicals of value, 

use as deterrent for insect herbivory and 
ovipostion 

modulation will increase resistance to UV 
light, 

decrease insect herbivory and oviposition, 
compostion changes for the cosmetics 
industry, 

decrease transpiration, 
provide pathogen resistance, 
UV protection, 

modulation of leaf runoff properties and 
improved access for herbicides and 
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Phenotype 
Modulated by a 
Gene 



Cotyledons 



Seeds 



Flowers 
Hypocotysl 



Transparent testa 

Light 
Dark 

Other 

Long 
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Utilities 



fertilizers 



modulation of seeds structure in legumes, 

increase nutritional value, 

improve seedling competion under field 

conditions, 

genes useful for metabolic engineering 
anthocyanin and flavonoid pathways 
improved nutritional content 

decrease petal abscission 
decrease pod shattering 

to improve germination rates 
to improve plant survivability 



Short 



to improve germination rates 
to improve plant survivability 
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VII. ENHANCED FOODS 

Animals require external supplies of amino acids that they cannot synthesize 
themselves. Also, some amino acids are required in larger quantities. The nutritional values 
of plants for animals and humans can thus be modified by regulating the amounts of the 
constituent amino acids that occur as free amino acids or in proteins. For instance, higher 
levels of lysine and/or methionine would enhance the nutritional value of corn seed. 
Applicants herein provide several methods for modulating the amino acid content: 

(1) expressing a naturally occurring protein that has a high percentage of the 
desired amino acid(s); 

(2) expressing a modified or synthetic coding sequence that has an enhanced 
percentage of the desired amino acids; or 

(3) expressing the protein(s) that are capable of synthesizing more of the desired 
amino acids. 

A specific example is expressing proteins with enhanced, for example, methionine content, 
preferentially in a corn or cereal seed used for animal nutrition or in the parts of plants used 
for nutritional purposes. 

A protein is considered to have a high percentage of an amino acid if the amount of 
the desired amino acid is at least 1% of the total number of residues in a protein; more 
preferably 2% or greater. Amino acids of particular interest are tryptophan, lysine, 
methionine, phenylalanine, threonine leucine, valine, and isoleucine. Examples of naturally 
occurring proteins with a high percentage of any one of the amino acid of particular interest 
are listed in the Enhanced Amino Table. 

The sequence(s) encoding the selected protein(s) are operably linked to a promoter 
and other regulatory sequences and transformed into a plant as described below. The 
promoter is chosen optimally for promoting the desired level of expression of the protein in 
the selected organ e.g. a promoter highly functional in seeds. Modifications may be made to 
the sequence encoding the protein to ensure protein transport into, for example, organelles or 
storage bodies or its accumulation in the organ. Such modifications may include addition of 
signal sequences at or near the N terminus and amino acid residues to modify protein stability 
or appropriate glycosylation. Other modifications may be made to the transcribed nucleic 
acid sequence to enhance the stability or translatability of the mRNA, in order to ensure 
accumulation of more of the desired protein. Suitable versions of the gene construct and 
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transgenic plants are selected on the basis of, for example, the improved amino acid content 
and nutritional value measured by standard biochemical tests and animal feeding trials. 
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VHI USE OF NOVEL GENES TO FACILITATE EXPLOITATION OF PLANTS AS 
FACTORIES FOR THE SYNTHESIS OF VALUBLE MOLECULES 

Plants and their constituent cells, tissues, and organs are factories that manufacture 
small organic molecules such as sugars, amino acids, fatty acids, vitamins, etc., as well as 
macromolecules such as proteins, nucleic acids, oils/fats and carbohydrates. Plants have 
long been a source of pharmaceutically beneficial chemicals; particularly, the secondary 
metabolites and hormone-related molecules synthesized by plants. Plants can also be used as 
factories to produce carbohydrates or lipids that comprises a carbon backbone useful as 
precursors of plastics, fiber, fuel, paper, pulp, rubber, solvents, lubricants, construction 
materials, detergents, and other cleaning materials. Plants can also generate other compounds 
that are of economic value, such as dyes, flavors, and fragrances. Both the intermediates as 
well as the end-products of plant bio-synthetic pathways have been found useful. 
With the polynucleotides and polypeptides of the instant invention, modification of both in- 
vitro and in- vivo synthesis of such products is possible. One method of increasing the 
amount of either the intermediates or the end-products synthesized in a cell is to increase the 
expression of one or more proteins in the synthesis pathway as discussed below. Another 
method of increasing production of an intermediate is to inhibit expression of protein(s) that 
synthesize the end-product from the intermediate. Levels of end-products and intermediates 
can also be modified by changing the levels of enzymes that specifically change or degrade 
them. The kinds of molecules made can be also be modified by changing the genes encoding 
specific enzymes performing reactions at specific steps of the biosynthetic pathway. These 
genes can be from the same or a different organism. The molecular structures in the 
biosynthetic pathways can thus be modified or diverted into different branches of a pathway 
to make novel end-products. 

Novel genes comprising selected promoters and sequences encoding enzymes are 
transformed into the selected plant to modify the levels, composition and/or structure of, 
without limitation: 

Terpenoids 
Alkaloids 

Hormones, including brassinosteriods 

Flavonoids 

Steroids 
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Vitamins such as 

Retinol 

Riboflavin 

Thiamine 
Caffeine 

Morphine and other alkaloids 
Peptides and amino acid synthesis 
Antioxidants 
Starches and lipids 
Fatty acids 

Fructose, mannose and other sugars 

Glycerolipid 

Citric acid 

Lignin 

Flavors 

Fragrances 

Essential oils 

Colors or dyes 

Gum 

Gels 

Waxes 



The modifications are made by designing one or more novel genes per application 
comprising promoters, to ensure production of the enzyme(s) in the relevant cells, in the right 
amount, and polynucleotides encoding the relevant enzyme. The promoters and 
polynucleotides are the subject of this application. The novel genes are transformed into the 
relevant species using standard procedures. Their effects are measured by standard assays for 
the specific chemical/biochemical products. 

These polynucleotides and proteins of the invention that participate in the relevant 
pathways and are useful for changing production of the above chemicals and biochemicals 
are identified in the Reference tables by their enzyme function. More specifically, proteins of 
the invention that have the following enzymatic activities are of interest to modulate the 



Page 573 of 772 



Attorney Dkt. 

Client Dkt. 80090.005 



corresponding pathways to produce precursors or final products noted above that are of 
industrial use. Biological activities of particular interest are listed below. 

Other polynucleotides and proteins that regulate where, when and to what extent a 
pathway is active in a plant are extremely useful for modulating the synthesis and 
accumulation of valuable chemicals. These elements including transcription factors, proteins 
involved in signal transduction and other proteins in the control of gene expression are 
described elsewhere in this application. 
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Pathway 


Enzyme 


Comments 


Name 


Description 




Alkaloid 
biosynthesis I 


Morphine 6- 
dehydrogenase 


Also acts on other alkaloids, including 
codeine, normorphine and 
ethylmorphine, but only very slowly 
on 7,8-saturated derivatives such as 
dihydromorphine and dihydrocodeine 
In the reverse direction, also reduces 
naloxone to the 6-alpha- hydroxy 
analog Activated by 2- 
mercaptoethanol 




Codeinone 

reductase 

(NADPH) 


Stereospecifically catalyses the 
reversible reduction of codeinone to 
codeine, which is a direct precursor of 
morphine in the opium poppy plant, 
Papaver somniferum 




Salutaridine 

reductase 

(NADPH) 


Stereospecifically catalyses the 
reversible reduction of salutaridine to 
salutaridinol, which is a direct 
precursor of morphinan alkaloids in 
the poppy plant, Papaver somniferum 




(S)-stylopine 
synthase 


Catalyses an oxidative reaction that 
does not incorporate oxygen into the 
product Forms the second 
methylenedioxy bridge of the 
protoberberine alkaloid stylopine from 
oxidative ring closure of adjacent 
phenolic and methoxy groups of 
cheilanthifoline 




(S)-cheilanthifoline 
synthase 


Catalyses an oxidative reaction that 
does not incorporate oxygen into the 
product Forms the methylenedioxy 
bridge of the protoberberine alkaloid 
cheilanthifoline from oxidative ring 
closure of adjacent phenolic and 
methoxy groups of scoulerine 




Salutaridine 
synthase 


Forms the morphinan alkaloid 
salutaridine by intramolecular phenol 
oxidation of reticuline without the 
incorporation of oxygen into the 
product 




(S)-canadine 
synthase 


Catalyses an oxidative reaction that 
does not incorporate oxygen into the 
product Oxidation of the 
methoxyphenol group of the alkaloid 
tetrahydrocolumbamine results in the 
formation of the methylenedioxy 
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bridge of canadine 




Protopine 6- 
monooxygenase 


Involved in benzophenanthridine 
alkaloid synthesis in higher plants 




Dihydrosanguinarin 
e 10- 

monooxygenase 


Involved in benzophenanthridine 
alkaloid synthesis in higher plants 




Monophenol 
monooxygenase 


A group of copper proteins that also 
catalyse the reaction of EC 1.10.3.1, if 
only 1,2-benzenediols are available as 
substrate 




L-amino acid 
oxidase 






1,2- 

dehydroreticuliniu 
m reductase 
(NADPH) 


Stereospecifically reduces the 1,2- 
dehydroreticulinium ion to (R)- 
reticuline, which is a direct precursor 
of morphinan alkaloids in the poppy 
plant, papaver somniferum The 
enzyme does not catalyse the reverse 
reaction to any significant extent 
under physiological conditions 




Dihydrobenzophena 
nthridine oxidase 


Also catalyzes: dihydrochelirubine + 
0(2) = chelirubine + H(2)0(2) Also 
catalyzes: dihydromacarpine + 0(2) = 
macarpine + H(2)0(2) Found in 
higher plants Produces oxidized forms 
of the benzophenanthridine alkaloids 




Reticuline oxidase 


The product of the reaction, (S)- 
scoulerine, is a precursor of protopine, 
protoberberine and 
benzophenanthridine alkaloid 
biosynthesis in plants Acts on (S)- 
reticuline and related compounds, 
converting the N- methyl group into 
the methylene bridge ('berberine 
bridge[PRIME]) of(S)- 
tetrahydroprotoberberines 




3[PRIME]- 

hydroxy-N-methyl- 

(S)-coclaurine 

4[PRIME]-0- 

methyltransferase 


Involved in isoquinoline alkaloid 
metabolism in plants Has also been 
shown to catalyse the methylation of 
(R,S)- laudanosoline, (S)-3[PRIME]- 
hydroxycoclaurine and (R,S)-7-0- 
methylnoraudanosoline 




(S)-scoulerine 9-0- 
methyltransferase 


The product of this reaction is a 
precursor for protoberberine alkaloids 
in plants 
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Columbamine 0- 
methyltransferase 


The product of this reaction is a 
protoberberine alkaloid that is widely 
distributed in the plant kingdom 
Distinct in specificity from EC 
2.1.1.88 




10- 

hydroxydihydrosan 
guinarine 10-O- 
methyltransferase 


Part of the pathway for synthesis of 
benzophenanthridine alkaloids in 
plants 




12- 

hydroxydihydrochel 
irubine 12-0- 
methyltransferase 


Part of the pathway for synthesis of 
benzophenanthridine alkaloid 
macarpine in plants 




(R,S)-norcoclaurine 
6-0- 

methyltransferase 


Norcoclaurine is 6,7-dihydroxy-l-[(4- 
hydroxyphenyl)methyl]- 1 ,2,3 ,4- 
tetrahydroisoquinoline The enzyme 
will also catalyse the 6-O-methylation 
of (R,S)- norlaudanosoline to form 6- 
O-methyl-norlaudanosoline, but this 
alkaloid has not been found to occur in 
plants 




Salutaridinol 7-0- 
acetyltransferase 


At higher pH values the product, 7-0- 
acetylsalutaridinol, spontaneously 
closes the 4->5 oxide bridge by allylic 
elimination to form the morphine 
precursor thebaine From the opium 
poppy plant, Papaver somniferum 




Aspartate 
aminotransferase 


Also acts on L-tyrosine, L- 
phenylalanine and L-tryptophan. This 
activity can be formed from EC 
2.6.1.57 by controlled proteolysis 




Tyrosine 
aminotransferase 


L-phenylalanine can act instead of L- 
tyrosine The mitochondrial enzyme 
may be identical with EC 2.6.1.1 The 
three isoenzymic forms are 
interconverted by EC 3.4.22.4 




Aromatic amino 
acid transferase 


L-methionine can also act as donor, 
more slowly Oxaloacetate can act as 
acceptor Controlled proteolysis 
converts the enzyme to EC 2.6.1.1 




Tyrosine 
decarboxylase 


The bacterial enzyme also acts on 3- 
hydroxytyrosine and, more slowly, on 
3-hydroxyphenylalanine 




Aromatic-L-amino- 
acid decarboxylase 


Also acts on L-tryptophan, 5-hydroxy- 
L-tryptophan and dihydroxy- L- 
phenylalanine (DOPA) 


Alkaloid 

biosynthesis 

II 


Tropine 
dehydrogenase 


Oxidizes other tropan-3-alpha-ols, but 
not the corresponding beta- 
derivatives 
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Tropinone 
reductase 






Hyoscyamine (6S)- 
dioxygenase 






6-beta- 

hydroxyhyoscyami 
ne epoxidase 






Amine oxidase 
(copper-containing) 


A group of enzymes including those 
oxidizing primary amines, diamines 
and histamine One form of EC 
1.3.1.15 from rat kidney also catalyses 
this reaction 




Putrescine N- 
methyltransferase 






Ornithine 
decarboxylase 






Oxalyl-CoA 
decarboxylase 






Phenylalanine 
ammonia-lyase 


May also act on L-tyrosine 


Androgen and 

estrogen 

metabolism 


3-beta-hydroxy- 
delta(5)-steroid * 
dehydrogenase 


Acts on 3-beta-hydroxyandrost-5-en- 
17-one to form androst-4-ene- 3,17- 
dione and on 3-beta-hydroxypregn-5- 
en-20-one to form progesterone 




11-beta- 

hydroxysteroid 

dehydrogenase 






Estradiol 17-alpha- 
dehydrogenase 


• 




3-alpha-hydroxy-5- 
beta-androstane- 1 7- 
one 3-alpha- 
dehydrogenase 






3-alpha(17-beta)- 
hydroxysteroid 
dehydrogenase 
(NAD+) 


Also acts on other 17-beta- 
hydroxy steroids, on the 3-alpha- 
hydroxy group of pregnanes and bile 
acids, and on benzene dihydrodiol 
Different from EC 1 . 1 . 1 .50 or EC 
1.1.1.213 




3-alpha- 
hydroxysteroid 
dehydrogenase (B- 
specific) ' 


Acts on other 3-alpha-hydroxysteroids 
and on 9-, 11- and 15- 
hydroxyprostaglandin B-specific with 
respect to NAD(+) or NADP(+) (cf. 
EC 1.1.1.213) 




3(or 17)beta- 

hydroxysteroid 

dehydrogenase 


Also acts on other 3 -beta- or 17-beta- 
hydroxysteroids (cf EC 1 . 1 . 1 .209) 
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Estradiol 17 beta- 
dehydrogenase 


Also acts on (S)-20-hydroxypregn-4- 
en-3-one and related compounds, 
oxidizing the (S)-20-group B-specific 
with respect to NAD(P)(+) 




Testosterone 17- 
beta-dehydrogenase 






Testosterone 17- 

beta-dehydrogenase 

(NADP+) 


Also oxidizes 3-hydroxyhexobarbital 
to 3-oxohexobarbital 




Steroid 11-beta- 
monooxygenase 


Also hydroxylates steroids at the 18- 
position, and converts 18- 
hydroxycorticosterone into 
aldosterone 




Estradiol 6-beta- 
monooxygenase 






Androst-4-ene- 

3,17-dione 

monooxygenase 


Has a wide specificity A single 
enzyme from Cylindrocarpon 
radicicola (EC 1.14.13.54) catalyses 
both this reaction and that catalysed 
by EC 1.14.99.4 




3-oxo-5-alpha- 
steroid 4- 
dehy dro genas e 






3 -oxo-5 -beta- 
steroid 4- 
dehydrogenase 






UDP- 

glucuronosyltransfe 
rase 


Family of enzymes accepting a wide 
range of substrates, including phenols, 
alcohols, amines and fatty acids Some 
of the activities catalysed were 
previously listed separately as EC 
2.4.1.42, EC 2.4.1.59, EC 2.4.1.61, EC 
2.4.1.76, EC 2.4.1.77, EC 2.4.1.84, EC 
2.4.1.107 and EC 2.4.1.108 A 
temporary nomenclature for the 
various forms whose delineation is in 
a state of flux 




Steroid 

sulfotransferase 


Broad specificity resembling EC 
2.8.2.2, but also acts on estrone 




Alcohol 
sulfotransferase 


Primary and secondary alcohols, 
including aliphatic alcohols, ascorbate, 
chloramphenicol, ephedrine and 
hydroxysteroids, but not phenolic 
steroids, can act as acceptors (cf. EC 
2.8.2.15) 




Estrone 
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sulfotransferase 






Arylsulfatase 


A group of enzymes with rather 
similar specificities 




Steryl-sulfatase 


Also acts on some related steryl 
sulfates 




1 7-alpha- 

hydroxyprogesteron 
e aldolase 






Steroid delta- 
isomerase 




C21 -Steroid 

hormone 

metabolism 


3-beta-hydroxy- 

delta(5)-steroid 

dehydrogenase 


Acts on 3-beta-hydroxyandrost-5-en- 
17-one to form androst-4-ene- 3,17- 
dione and on 3-beta-hydroxypregn-5- 
en-20-one to form progesterone 




11-beta- 

hydroxysteroid 

dehydrogenase 






20-alpha- 

hydroxysteroid 

dehydrogenase 


A-specific with respect to NAD(P)(+) 




3-alpha- 
hydroxysteroid 
dehydrogenase (B- 
specific) 


Acts on other 3 -alpha-hydroxy steroids 
and on 9-, 11- and 15- 
hydroxyprostaglandin B-specific with 
respect to NAD(+) or NADP(+) (cf. 
EC 1.1.1.213) 




3-alpha(or 20-beta)- 

hydroxysteroid 

dehydrogenase 


The 3-alpha-hydroxyl group or 20- 
beta-hydroxyl group of pregnane and 
androstane steroids can act as donors 




Steroid 11-beta- 
monooxygenase 


Also hydroxylates steroids at the 18- 
position, and converts 18- 
hydroxycorticosterone into 
aldosterone 




Corticosterone 18- 
monooxygenase 






Cholesterol 
monooxygenase 
(side-chain 
cleaving) 


The reaction proceeds in three stages, 
with hydroxylation at C-20 and C-22 
preceding scission of the side-chain at 
C-20 




Steroid 21- 
monooxygenase 






Progesterone 11- 
alpha- 

monooxygenase 
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Steroid 17-alpha- 
monooxygenase 






Cholestenone 5- 
beta-reductase 






Cortisone beta- 
reductase 






Progesterone 5- 
alpha-reductase 


Testosterone and 20-alpha-hydroxy-4- 
pregnen-3-one can act in place of 
progesterone 




3-oxo-5-beta- 
steroid 4- 
dehydrogenase 






Steroid delta- 
isomerase 




Flavonoids, 
stilbene and 
lignin 

biosynthesis 


Coniferyl-alcohol 
dehydrogenase 


Specific for coniferyl alcohol; does 
not act on cinnamyl alcohol, 4- 
coumaryl alcohol or sinapyl alcohol 




Cinnamyl-alcohol 
dehydrogenase 


Acts on coniferyl alcohol, sinapyl 
alcohol, 4-coumaryl alcohol and 
cinnamyl alcohol (cf. EC 1.1.1.194) 




Dihydrokaempferol 
4-reductase 


Also acts, in the reverse direction, on 
(+)-dihydroquercetin and (+)- 
dihydromyricetin Each 
dihydroflavonol is reduced to the 
corresponding cis-flavon- 3,4-diol 
NAD(+) can act instead of NADP(+), 
more slowly Involved in the 
biosynthesis of anthocyanidins in 
plants 




Flavonone 4- 
reductase 


Involved in the biosynthesis of 3- 
deoxyanthocyanidins from flavonones 
such as naringenin or eriodictyol 




Peroxidase 






Caffeate 3,4- 
dioxygenase 






Naringenin 3- 
dioxygenase 






Trans-cinnamate 4- 
monooxygendbe 


Also acts on NADH, more slowly 




Trans-cinnamate 2- 
monooxygenase 
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Flavonoid 

3[PRIME]- 

monooxygenase 


Acts on a number of flavonoids, 
including naringenin and 
dihydrokaempferol Does not act on 4- 
coumarate or 4-coumaroyl-CoA 




Monophenol 
monooxygenase 


A group of copper proteins that also 
catalyse the reaction of EC 1.10.3.1, if 
only 1,2-benzenediols are available as 
substrate 




Cinnamoyl-CoA 
reductase 


Also acts on a number of substituted 

1 A „ A 

cinnamoyl esters of coenzyme A 




Caffeoyl-CoA O- 
methyltransferase 






Luteolin O- 
methyltransferase 


Also acts on luteolin-7-O-beta-D- 
glucoside 




Caffeate 0- 
methyltransferase 


3,4-dihydroxybenzaldehyde and 
catechol can act as acceptor, more 
slowly 




Apigenin 

4[PRIME]-0- 

methyltransferase 


Converts apigenin into acacetin 
Naringenin (5, 7,4 [PRIME] - 
trihydroxyflavonone) can also act as 
acceptor, more slowly 




Ouercetin 3-0- 
methyltransferase 


Specific for quercetin. Related 
enzymes bring about the 3-0- 
methylation of other flavonols, such as 
galangin and kaempferol 




Isoflavone-7-O- 
beta-glucoside 
6[PRIME][PRIME] 
-0- 

malonyltransferase 


The 6-position of the glucose residue 
of formononetin can also act as 
acceptor Some other 7-0-glucosides 
of isoflavones, flavones and flavonols 
can also act, more slowly 




Pinosylvin synthase 


Not identical with EC 2.3.1.74 or EC 
2.3.1.95 




Naringenin- 
chalcone synthase 


In the presence of NADH and a 
reductase, 6[PRIME]-deoxychalcone 
is produced 




Trihydroxystilbene 
synthase 


Not identical with EC 2.3.1.74 or EC 
2.3.1.146 




Quinate 0- 

hydroxycinnamoylt 

ransferase 


Caffeoyl-CoA and 4-coumaroyl-CoA 
can also act as donors, more slowly 
Involved in the biosynthesis of 
chlorogenic acid in sweet potato and, 
with EC 2.3.1.98 in the formation of 
caffeoyl-CoA in tomato 




Coniferyl-alcohol 
glucosyltransferase 


Sinapyl alcohol can also act as 
acceptor 




2-coumarate 0- 
beta- 

glucosyltransferase 


Coumarinate (cis-2- 
hydroxycinnamate) does not act as 
acceptor 
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Scopoletin 
glucosyltransferase 






Flavonol-3-O- 
glucoside L- 
rhamnosyltransferas 
e 


Converts flavonol 3-O-glucosides to 
3-O-rutinosides Also acts, more 
slowly, on rutin, quercetin 3-0- 
galactoside and flavonol 03- 
rhamnosides 




Flavone 7-O-beta- 
glucosyltransferase 


A number of flavones, flavonones and 
flavonols can function as acceptors 
Different from EC 2.4.1.91 




Flavonol 3-0- 
glucosyltransferase 


Acts on a variety of flavonols, 
including quercetin and quercetin 7-0- 
glucoside Different from EC 2.4.1.81 




Flavone 

apiosyltransferase 


7-O-beta-D-glucosides of a number of 
flavonoids and of 4-substituted 
phenols can act as acceptors 




Coniferin beta- 
glucosidase 


Also hydrolyzes syringin, 4-cinnamyl 
alcohol beta-glucoside, and, more 
slowly, some other aryl beta- 
glycosides A plant cell-wall enzyme 
involved in the biosynthesis of lignin 




Beta-glucosidase 


Wide specificity for beta-D- 
glucosides. Some examples also 
hydrolyse one or more of the 
following: beta-D-galactosides, alpha- 
L- arabinosides, beta-D-xylosides, and 
beta-D-fucosides 




Chalcone isomerase 






4-coumarate— CoA 
ligase 
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Pathway Name 


Enzyme 
Description 


Enzyme Comments 


Ascorbate and aldarate 
metabolism 


D-threo-aldose 
1- 

dehydrogenase 


Acts on L-fucose, D- 
arabinose and L- 
xylose The animal 
enzyme was also 
shown to act on L- 
arabinose, and the 
enzyme from 
Pseudomonas 
caryophylli on L- 
glucose 




L-threonate 3- 
dehydrogenase 






Glucuronate 
reductase 


Also reduces D- 
galacturonate May 
be identical with EC 
1.1.1.2 




Glucuronolacton 
e reductase 






L-arabinose 1- 
dehydrogenase 






L- 

galactonolactone 
oxidase 


Acts on the 1 ,4- 
lactones of L- 
galactonic, D- 
altronic, L-fuconic f 
D-arabinic and D- 
threonic acids Not 
identical with EC 
1.1.3.8 (cf. EC 
1.3.2.3) 




L-gulonolactone 
oxidase 


The product 
spontaneously 
isomerizes to L- 
ascorbate 




L-ascorbate 
oxidase 






L-ascorbate 
peroxidase 






Ascorbate 2,3- 
dioxygenase 






2,5- 

aioxovaierate 
dehydrogenase 






Aldehyde 

dehydrogenase 

(NAD+) 


Wide specificity, 
including oxidation 
of D- 
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glucuronolactone to 
D-glucarate 




Galactonolacton 
e 

dehydrogenase 


Cf. EC 1.1.3.24 




Monodehydroas 
corbate 
reductase 
(NADH) 






Glutathione 

dehydrogenase 

(ascorbate) 






L- 

arabinonolacton 
ase 






Gluconolactonas 
e 


Acts on a wide 
range of hexono- 
1,5-lactones 




Uronolactonase 






1,4-lactonase 


Specific for 1 ,4- 
lactones with 4-8 
carbon atoms Does 
not hydrolyse simple 
aliphatic esters, 
acetylcholine, sugar 
lactones or 
substituted aliphatic 
lactones, e.g. 3- 
hydroxy-4- 
butyrolactone 




2-dehydro-3- 

deoxyglucarate 

aldolase 






L-arabinonate 
dehydratase 






Glucarate 
dehydratase 






5-dehydro-4- 

deoxyglucarate 

dehydratase 






Galactarate 
dehydratase 






2-dehydro-3- 
deoxy-L- 
arabinonate 
dehydratase 




Carbon fixation 


Malate 

dehydrogenase 


Also oxidizes some 
other 2- 

hydroxydicarboxylic 
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acids 




Malate 

dehydrogenase 
(decarboxylating 

) 


Does not 
decarboxylates 
added oxaloacetate 




Malate. 

dehydrogenase 
(oxaloacetate 
decarboxylating) 
(NADP+) 


Also decarboxylates 
added oxaloacetate 




Malate 

dehydrogenase 
(NADP+) 


Activated by light 




Glyceraldehyde- 

3-phosphate 

dehydrogenase 

(NADP+) 

(phosphorylating 

) 






Transketolase 


Wide specificity for 
both reactants, e.g. 
converts 
hydroxy pyruvate 
and R-CHO into 
CO(2) and R- 
CHOH-CO- 
CH(2)OH 

Transketolase from 
Alcaligenes faecalis 
shows high activity 
with D-erythrose as 
acceptor 




Aspartate 

aminotransferas 

e 


Also acts on L- 
tyrosine, L- 
phenylalanine and 
L-tryptophan. This 
activity can be 
formed from EC 
2.6.1.57 by 
controlled 
proteolysis 




Alanine 

aminotransferas 
e 


2-aminobutanoate 
acts slowly instead 
of alanine 




Sedoheptulokina 
se 






Phosphoribuloki 
nase 






Pyruvate kinase 


UTP, GTP, CTP, 
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ITP and dATP can 
also act as donors 
Also phosphorylates 
hydroxylamine and 
fluoride in the 
presence of CO(2) 




Phosphoglycerat 
e kinase 






Pyruvate,phosph 
ate dikinase 






Fructose- 
bisphosphatase 


The animal enzyme 
also acts on 
sedoheptulose 1,7- 
bisphosphate 




Sedoheptuiose- 
bisphosphatase 






Phosphoenolpyr 
uvate 

carboxylase 






Ribulose- 

bisphosphate 

carboxylase 


Will utilize 0(2) 
instead of C0(2), 
forming 3-phospho- 
D-glycerate and 2- 
phosphoglycolate 




Phosphoenolpyr 
uvate 

carboxykinase 
(ATP) 






Fructose- 

bisphosphate 

aldolase 


Also acts on 
(3S,4R)-ketose di- 
phosphates The 
yeast and bacterial 
enzymes are zinc 
proteins The 
enzymes increase 
electron-attraction 
by the carbonyl 
group, some (Class 

I) forming a 
protonated imine 
with it, others (Class 

II) , mainly of 
microbial origin, 
polarizing it with a 
metal ion, e.g zinc 




Phosphoketolas 
e 






Ribulose- 
phosphate 3- 


Also converts D- 
erythrose 4- 
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epimerase 


phosphate into D- 
erythrulose 4- 
phosphate and D- 
threose 4-phosphate 




Triosephosphate 
isomerase 






Ribose 5- 

phosphate 

epimerase 


Also acts on D- 
ribose 5- 

diphosphate and D- 
ribose 5- 
triphosphate 


Phenylalanine metabolism 


(R)-4- 

hydroxyphenylla 
ctate 

dehydrogenase 


Also acts, more 
slowly, on (R)-3- 
phenyllactate, (R)-3- 
(indole-3- yl)lactate 
and (R)-lactate 




Hydroxyphenylp 

yruvate 

reductase 


Also acts on 3-(3,4- 
dihydroxyphenyl)lact 
ate Involved with EC 
2.3.1.140 in the 
biosynthesis of 
rosmarinic acid 




Aryl-alcohol 
dehydrogenase 


A group of enzymes 
with broad specificity 
towards primary 
alcohols with an 
aromatic or 
cyclohex-1-ene ring, 
but with low or no 
activity towards 
short-chain aliphatic 
alcohols 




Peroxidase 






Catechol 1 ,2- 
dioxygenase 


Involved in the 
metabolism of nitro- 
aromatic 
compounds by a 
strain of 
Pseudomonas 
putida 




2,3- 

d [hydroxy be nzoa 
te 3,4- 
dioxygenase 






3- 

carboxyethylcate 
chol 2,3- 
dioxygenase 






Catechol 2,3- 


The enzyme from 
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dioxygenase 


Alcaligines sp. strain 
0-1 has also been 
shown to catalyse 
the reaction: 3- 
Sulfocatechol + 0(2) 
+ H(2)0 = 2- 
hydroxymuconate + 
bisulfite. It has been 
referred to as 3- 
sulfocatechol 2,3- 
dioxygenase. 
Further work will be 
necessary to show 
whether or not this is 
a distinct enzyme 




4- 

hydroxyphenylpy 
ruvate 

dioxygenase 






Protocatechuate 
3,4-dioxygenase 






Hydroxyquinol 
1,2-dioxygenase 


The product 
isomerizes to 2- 
maleylacetate (cis- 
hex-2-enedioate) 
Highly specific; 
catechol and 
pyrogallol are acted 
on at less than 1% 
of the rate at which 
benzene-1 ,2,4-thol 
is oxidized 




Protocatechuate 
4,5-dioxygenase 






Phenylalanine 2- 
monooxygenase 


Also catalyses a 
reaction similar to 
that of EC 1.4.3.2, 
forming 3- 
phenylpyruvate, 
NH(3) and H(2)0(2), 
but more slowly 




Anthranilate 1 2- 
dioxygenase 
(deaminating, 
decarboxylating) 






Benzoate 1 ,2- 
dioxygenase 


A system, containing 
a reductase which is 
an iron-sulfur 
flavoprotein (FAD), 
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and an iron-sulfur 
oxygenase 




Toluene 
dioxygenase 


A system, containing 
a reductase which is 
an iron-sulfur 
flavoprotein (FAD), 
an iron-sulfur 
oxygenase, and a 
ferredoxin Some 
other aromatic 
compounds, 
including 
ethylbenzene, 4- 
xylene and some 
halogenated 
toluenes, are 
converted into the 
corresponding cis- 
dihydrodiols 




Naphthalene 
1,2-dioxygenase 


A system, containing 
a reductase which is 
an iron-sulfur 
flavoprotein (FAD), 
an iron-sulfur 
oxygenase, and 
ferredoxin 




Benzene 1,2- 
dioxygenase 


A system, containing 
a reductase which is 
an iron-sulfur 
flavoprotein, an iron- 
sulfur oxygenase 
and ferredoxin 




Salicylate 1- 
monooxygenase 






Trans-cinnamate 
4- 

monooxygenase 


Also acts on NADH, 
more slowly 




Benzoate 4- 
monooxygenase 






4- 

hydroxybenzoat 
e3- 

monooxygenase 


Most enzymes from 
Pseudomonas are 
highly specific for 
NAD(P)H (cf EC 
1.14.13.33) 




3- 

hydroxybenzoat 
e4- 

monooxygenase 


Also acts on a 
number of analogs 
of 3- 

hydroxybenzoate 
substituted in the 2, 
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4, 5 and 6 positions 




3- 

hydroxybenzoat 
e6- 

monooxygenase 


Also acts on a 
number of analogs 
of 3- 

hydroxybenzoate 
substituted in the 2, 
4, 5 and 6 positions 
NADPH can act 
instead of NADH, 
more slowly 




4- 

hydroxybenzoat 
e3- 

monooxygenase 
(NAD(P)H) 


The enzyme from 
Corynebacterium 
cyclohexanicum is 
highly specific for 4- 
hydroxybenzoate, 
but uses NADH and 
NADPH at 
approximately equal 
rates (cf. EC 
1.14.13.2). It is less 
specific for NADPH 
than EC 1.14.13.2 




Anthranilate 3- 

monooxygenase 

(deaminating) 


The enzyme from 
Aspergillus niger is 
an iron protein; that 
from the yeast 
Trichosporon 
cutaneum is a 
flavoprotein (FAD) 




Melilotate 3- 
monooxygenase 






Phenol 2- 
monooxygenase 


Also active with 
resorcinol and O- 
cresol 




Mandelate 4- 
monooxygenase 






3- 

hydroxybenzoat 
e2- 

monooxygenase 






4-cresol 

dehydrogenase 

(hydroxylating) 


Phenazine 
methosulfate can act 
as acceptor A 
quinone methide is 
probably formed as 
intermediate The 
product is oxidized 
further to 4- 
hydroxybenzoate 
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Benzaldehyde 

dehydrogenase 

(NAD+) 






Aminomuconate- 

semialdehyde 

dehydrogenase 


Also acts on 2- 

hydroxymuconate 

semialdehyde 




Phenylacetaldeh 
yde 

dehydrogenase 






4-carboxy-2- 

hydroxymuconat 

e-6- 

semialdehyde 
dehydrogenase 


Does not act on 
unsubstituted 
aliphatic or aromatic 
aldehydes or 
glucose NAD(+) can 
replace NADP(+), • 
but with lower 
affinity 




Aldehyde 

dehydrogenase 

(NAD(P)+) 






Benzaldehyde 

dehydrogenase 

(NADP+) 






Coumarate 
reductase 






Cis-1,2- 

dihydrobenzene- 
1,2-diol 

dehydrogenase 






Cis-1,2-dihydro- 
1,2- 

dihydroxynaphth 
alene 

dehydrogenase 


Also acts, at half the 
rate, on cis- 
anthracene 
dihydrodiol and cis- 
phenanthrene 
dihydrodiol 




2-enoate 
reductase 


Acts, in the reverse 
direction, on a wide 
range of alkyl and 
aryl alpha, beta- 
unsatu rated 
carboxylate ions 2- 
butenoate was the 
best substrate 
tested 




Maleylacetate 
reductase 






Phenylalanine 
dehydrogenase 


The enzyme from 
Bacillus badius and 
Sporosarcina ureae 
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are highly specific 
for L-phenylalanine, 
that from Bacillus 
sphaericus also acts 
on L-tyrosine 




L-amino acid 
oxidase 






AminG oxidase 
(flavin- 
containing) 


Acts on primary 
amines, and usually 
also on secondary 
and tertiary amines 




Amine oxidase 
(copper- 
containing) 


A group of enzymes 
including those 
oxidizing primary 
amines, diamines 
and histamine One 
form of EC 1.3.1.15 
from rat kidney also 
catalyses this 
reaction 




D-amino-acid 
dehydrogenase 


Acts to some extent 
on all D-amino acids 
except D-aspartate 
and D-glutamate 




Aralkylamine 
dehydrogenase 


Phenazine 
methosulfate can act 
as acceptor Acts on 
aromatic amines 
and, more slowly, on 
some long-chain 
aliphatic amines, but 
not on methylamine 
or ethylamine (cf EC 
1.4.99.3) 




Glutamine N- 

phenylacetyitran 

sferase 






Acetyl-CoA C- 
acyltransferase 






D-amino-acid N- 
acetyltransferas 
e 






Phenylalanine 
N- 

acetyltransferas 
e 


Also acts, more 
slowly, on L-histidine 
and L-alanine 




Glycine N- 

benzoyltransfera 

se 


Not identical with EC 
2.3.1.13 or EC 
2.3.1.68 
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Aspartate 

aminotransferas 

e 


Also acts on L- 
tyrosine, L- 
phenylalanine and 
L-tryptophan. This 
activity can be 
formed from EC 
2.6.1.57 by 
controlled 
proteolysis 




D-alanine 

aminotransferas 

e 


Acts on the D- 

isomers of leucine, 

aspartate, 

glutamate, 

aminobutyrate, 

norvaline and 

asparagine 




Tyrosine 

aminotransferas 

e 


L-phenylalanine can 
act instead of L- 
tyrosine The 
mitochondrial 
enzyme may be 
identical with EC 
2.6.1.1 The three 
isoenzymic forms 
are interconverted 
by EC 3.4.22.4 




Aromatic amino 
acid transferase 


L-methionine can 
also act as donor, 
more slowly 
Oxaloacetate can 
act as acceptor 
Controlled 
proteolysis converts 
the enzyme to EC 
2.6.1.1 




Histidinol- 
phosphate 
aminotransferas 
e 






3-oxoadipate 
CoA-transferase 






3-oxoadipate 
enol-lactonase 


Acts on the product 
of EC 4.1.1.44 




Carboxymethyle 
nebutenolidase 






2-pyrone-4,6- 
dicarboxylate 
lactonase 


The product 
isomerizes to 4- 
oxalmesaconate 




Hippurate 


Acts on various N- 
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hydrolase 


benzoylamino acids 




Amidase 






Acylphosphatas 
e 






2- 

hydroxymuconat 

e-semialdehyde 

hydrolase 






Aromatic-L- 

amino-acid 

decarboxylase 


Also acts on L- 
tryptophan, 5- 
hydroxy-L- 
tryptophan and 
di hydroxy- L- 
phenylalanine 
(DOPA) 




Phenylpyruvate 
decarboxylase 


Also acts on indole- 
s-pyruvate 




4- 

carboxymuconol 
actone 

decarboxylase 






O- 

pyrocatechuate 
decarboxylase 






Phenylalanine 
decarboxylase 


Also acts on tyrosine 
and other aromatic 
amino acids 




4- 

hydroxybenzoat 
e decarboxylase 






Protocatechuate 
decarboxylase 






Benzoylformate 
decarboxylase 






4-oxalocrotonate 
decarboxylase 


Involved in the 
meta-cleavage 
pathway for the 
degradation of 
phenols, cresols and 
catechols 




4-hydroxy-4- 
methyl-2- 
oxoglutarate 
aldolase 


Also acts on 4- 
hydroxy-4-methyl-2- 
oxoadipate and 4- 
carboxy-4-hydroxy- 
2-oxohexadioate 




2-oxopent-4- 

enoate 

hydratase 


Also acts, more 
slowly, on cis-2- 
oxohex-4-enoate, 
but not on the trans- 
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isomer 




Phenylalanine 
ammonia-lyase 


May also act on L- 
tyrosine 




Phenylalanine 
racemase (ATP- 
hydrolysing) 






Mandelate 
racemase 






Phenylpyruvate 
tautomerase 


Also acts on other 
aryl pyruvates 




5- 

carboxymethyl- 
2- 

hydroxymuconat 
e delta- 
isomerase 






Muconolactone 
delta-isomerase 






Muconate 
cycloisomerase 


Also acts, in the 
reverse reaction, on 
3-methyl-cis-cis- 
hexa- dienedioate 
and, very slowly, on 
cis-trans- 

hexadienedioate Not 
identical with EC 
5.5.1.7 or EC 
5.5.1.11 




3-carboxy- 

cis,cis-muconate 

cycloisomerase 






Carboxy-cis,cis- 

muconate 

cyclase 






Chloromuconate 
cycloisomerase 


Spontaneous 
elimination of HCI 
produces cis-4- 
carboxymethylenebu 
t-2- en-4-olide Also 
acts in reverse 
direction on 2- 
chloro-cis,cis- 
muconate Not 
identical with EC 
5.5.1.1 or EC 
5.5.1.11 




Phenylacetate-- 
CoA ligase 


Phenoxyacetate can 

replace 

phenylacetate 
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Benzoate-CoA 
ligase 


Also acts on 2-, 3- 
and 4- 

fluorobenzoate, but 
only very slowly on 
the corresponding 
chlorobenzoates 




4- 

hydroxybenzoat 
e--CoA ligase 






Phenylacetate- 
CoA ligase 


Also acts, more 
slowly, on acetate, 
propanoate and 
butanoate, but not 
on hydroxy 
derivatives of 
phenylacetate and 
related compounds 


Phenylalanine, tyrosine and 
tryptophan biosynthesis 


Quinate 5- 
dehydrogenase 






Shikimate 5- 
dehydrogenase 






Quinate 
dehydrogenase 
(pyrroloquinoline 
-quinone) 






Phenylalanine 4- 
monooxygenase 






Prephenate 
dehydrogenase 


This enzyme in the 
enteric bacteria also 
possesses 
chorismate mutase 
activity (EC 
5.4.99.5) and 
converts chorismate 
into prephenate 




Prephenate 

dehydrogenase 

(NADP+) 






Cyclohexadienyl 
dehydrogenase 


Also acts on 
prephenate and D- 
prephenyllactate (cf. 
EC 1.3.1.12) 




2-methyl- 
branched-chain- 
enoyl-CoA 
reductase 


From Ascaris suum 
The reaction 
proceeds only in the 
presence of another 
flavoprotein 
(ETF=[PRIME]Electr 
on-Transferring 
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Flavoprotein[PRIME] 
) 




Phenylalanine 
dehydrogenase 


The enzyme from 
Bacillus badius and 
Sporosarcina ureae 
are highly specific 
for L-phenylalanine, 
that from Bacillus 
sphaericus also acts 
on L-tyrosine 




L-amino acid 
oxidase 






Anthranilate 

phosphoribosyltr 

ansferase 


In some organisms, 
this enzyme is part 
of a multifunctional 
protein together with 
one or more 
components of the 
system for 
biosynthesis of 
tryptophan (EC 
4.1.1.48, EC 
4.1.3.27, EC 
4.2.1.20, and EC 
5.3.1.24) 




3- 

phosphoshikimat 
e 1- 

carboxyvinyltran 
sferase 






Aspartate 

aminotransferas 

e 


Also acts on L- 
tyrosine, L- 
phenylalanine and 
L-tryptophan. This 
activity can be 
formed from EC 
2.6.1.57 by 
controlled 
proteolysis 




Tyrosine 

aminotransferas 

e 


L-phenylalanine can 
act instead of L- 
tyrosine The 
mitochondrial 
enzyme may be 
identical with EC 
2.6.1.1 The three 
isoenzymic forms 
are interconverted 
by EC 3.4.22.4 
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Aromatic amino 
acid transferase 


L-methionine can 
also act as donor, 
more slowly 
Oxaloacetate can 
act as acceptor 
Controlled 
proteolysis converts 
the enzyme to EC 
2.6.1.1 




Histidinol- 

1 II V 1 \Jt III 1 

phosphate 

aminotransferas 

e 






Shikimate kinase 






lndole-3- 
glycerol- 
phosphate 
synthase 


In some organisms, 
this enzyme is part 
of a multifunctional 
protein together with 
one or more 
components of the 
system for 
biosynthesis of 
tryptophan (EC 
2.4.2.18, EC 
4.1.3.27, EC 
4.2.1.20, and EC 
5.3.1.24) 




2-dehvdro-3- 
deoxyphosphoh 
eptonate 
aldolase 






Anthranilate 
synthase 


In some organisms, 
this enzyme is part 
of a multifunctional 
protein together with 
one or more 
components of the 
system for 
biosynthesis of 
tryptophan (EC 
2.4.2.18, EC 
4.1.1.48, EC 
4.2.1.20, and EC 
5.3.1.24) The native 
enzyme in the 
complex with uses 
either glutamine or 
(less efficiently) 
NH(3). The enzyme 
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separated from the 
complex uses NH(3) 
only 




3- 

dehydroquinate 
dehydratase 






Phosphopyruvat 
e hydratase 


Also acts on 3- 

phospho-D- 

erythronate 




Tryptophan 
synthase 


Also catalyses the 
conversion of serine 
and indole into 
tryptophan and 
water and of 
indoleglycerol 
phosphate into 
indole and 
glyceraldehyde 
phosphate In some 
organisms, this 
enzyme is part of a 
multifunctional 
protein together with 
one or more 
components of the 
system for 
biosynthesis of 
tryptophan (EC 
2.4.2.18, EC 
4.1.1.48, EC 
4.1.3.27, and EC 
5.3.1.24) 




Prephenate 
dehydratase 


This enzyme in the 
enteric bacteria also 
possesses 
chorismate mutase 
activity and converts 
chorismate into 
prephenate 




Carboxycyclohe 

xadienyl 

dehydratase 


Also acts on 
prephenate and D- 
prephenyllactate Cf. 
EC 4.2.1.51 




3- 

dehydroquinate 
synthase 


The hydrogen atoms 
on C-7 of the 
substrate are 
retained on C-2 of 
the products 




Chorismate 


Shikimate is 
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synthase 


numbered so that 
the double-bond is 
between C-1 and C- 
2, but some earlier 
papers numbered in 
the reverse direction 




Phosphoribosyla 

nthranilate 

isomerase 


In some organisms, 
this enzyme is part 
of a multifunctional 
protein together with 
one or more 
components of the 
system for 
biosynthesis of 
tryptophan (EC 
2.4.2.18, EC 
4.1.1.48, EC 
4.1.3.27, and EC 
4.2.1.20) 




Chorismate 
mutase 






Tyrosine-tRNA 
ligase 






Phenylalanine- 
tRNA ligase 




Starch and sucrose 
metabolism 


UDP-glucose 6- 
dehydrogenase 


Also acts on UDP-2- 
deoxyglucose 


* 


Glucoside 3- 
dehydrogenase 


The enzyme acts on 
D-glucose, D- 
galactose, D- 
glucosides and D- 
qalactosides, but D- 
glucosides react 
more rapidly than D- 
galactosides 




CDP-4-dehydro- 

6-deoxyglucose 

reductase 


Two proteins are 
involved but no 
partial reaction has 
been observed in 
the presence of 
either alone 




Phosphorylase 


The recommended 
name should be 
qualified in each 
instance by adding 
the name of the 
natural substance, 
e.g. maltodextrin 
phosphorylase, 
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starch 

phosphorylase, 

glycogen 

phosphorylase 




Levansucrase 


Some other sugars 
can act as D- 
fructosyl acceptors 




Glycogen 

(starch) 

synthase 


The recommended 
name varies 
according to the 
source of the 
enzyme and the 
nature of its 
synthetic product 
Glycogen synthase 
from animal tissues 
is a complex of a 
catalytic subunit and 
the protein 
glycogenin The 
enzyme requires 
glucosylated 
glycogenin as a 
primer; this is the 
reaction product of 
EC 2.4.1. 186 A 
similar enzyme 
utilizes ADP-glucose 
(Cf. EC 2.4.1.21) 




Cellulose 
synthase (UDP- 
forming) 


Involved in the 
synthesis of 
cellulose A similar 
enzyme utilizes 
GDP-glucose (Cf. 
EC 2.4.1.29) 




Sucrose 
synthase 






Sucrose- 
phosphate 
synthase 






Alpha.alpha- 
trehalose- 
phosphate 
synthase (UDP- 
forming) 


See also EC 
2.4.1.36 




UDP- 

glucuronosyltran 
sferase 


Family of enzymes 
accepting a wide 
range of substrates, 
including phenols, 




Page 602 of 772 



Attorney Ukt. 
£^ Client Dkt. 80090.005 







alcohols, amines 
and fatty acids 
Some of the 
activities catalysed 
were previously 
listed separately as 
EC 2.4.1.42, EC 
2.4.1.59, EC 
2.4.1.61, EC 

2.4.1.76, EC 

2.4.1.77, EC 
2.4.1.84, EC 

2.4.1.107 and EC 

2.4.1.108 A 
temporary 

nomenclature for the 
various forms whose 
delineation is in a 
state of flux 




1,4-alpha-glucan 

branching 

enzyme 


Converts amylose 
into amylopectin The 
recommended name 
requires a 
qualification 
depending on the 
product, glycogen or 
amylopectin, e.g. 
glycogen branching 
enzyme, 
amylopectin 
branching enzyme. 
The latter has 
frequently been 
termed Q-enzyme 




Cellobiose 
phosphorylase 






Starch (bacterial 

glycogen) 

synthase 


The recommended 
name various 
according to the 
source of the 
enzyme and the 
nature of its 
synthetic product, 
e.g. starch synthase, 
bacterial glycogen 
synthase A similar 
enzyme utilizes 
UDP-glucose (Cf. 
EC 2.4.1.11) 
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4-alpha- 

glucanotransfera 
se 


An enzymic activity 
of this nature forms 
part of the 
mammalian and 
Yeast glycogen 
branching system 
(see EC 3.2.1.33) 




Cellulose 
synthase (GDP- 
forming) 


Involved in the 
synthesis of 
cellulose A similar 
enzyme utilizes 
UDP-glucose (Cf. 
EC 2.4.1.12) 




1,3-beta-glucan 
synthase 






Phenol beta- 

glucosyltransfera 

se 


Acts on a wide 
range of phenols 




Amylosucrase 






Polyqalacturonat 
e 4-alpha- 
galacturonosyltr 
ansferase 






Dextransucrase 






Alpha.alpha- 

trehalose 

phosphorylase 






Sucrose 
phosphorylase 


In the forward 
reaction, arsenate 
may replace 
phosphate In the 
reverse reaction 
various ketoses and 
L-arabinose may 
replace D-fructose 




Maltose 
phosphorylase 






1,4-beta-D-xylan 
synthase 






Hexokinase 


D-glucose, D- 
mannose, D- 
fructose, sorbitol 
and D-glucosamine 
can act as acceptors 
ITP and dATP can 
act as donors The 
liver isoenzyme has 
sometimes been 
called glucokinase 
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Phosphoglucoki 
nase 






Glucose-1,6- 
bisphosphate 
synthase 


D-glucose 6- 
phosphate can act 
as acceptor, forming 
D-glucose 1,6- 
bisphosphate 




Glucokinase 


A group of enzymes 
found in 

invertebrates and 
microorganisms 
highly specific for 
glucose 




Fructokinase 






Glucose-1- 
phosphate 
phosphodismuta 
se 






Protein-N(PI)- 

phosphohistidine 

-sugar 

phosphotransfer 
ase 


Comprises a group 
of related enzymes 
The protein 
substrate is a 
phosphocarrier 
protein of low 
molecular mass (9.5 
Kd) A 

phosphoenzyme 
intermediate is 
formed The enzyme 
translocates the 
sugar it 

phosphorylates into 
bacteria 

Aldohexoses and 
their glycosides and 
alditols are 
phosphorylated on 
0-6; fructose and 
sorbose on 0-1 
Glycerol and 
disaccharides are 
also substrates 




Glucose-1- 
phosphate 
adenylyltransfer 
ase 






Glucose-1- 
phosphate 
cytidylyltransfera 
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se 






Glucose-1- 
phosphate 
guanylyltransfer 
ase 


Also acts, more 
slowly, on D- 
mannose di- 
phosphate 




UTP--glucose-1- 
phosphate 
uridylyltransferas 
e 






Pectinesterase 






Trehalose- 
phosphatase 






Sucrose- 
phosphatase 






Glucose-6- 
phosphatase 


Wide distribution in 
animal tissues Also 
catalyses potent 
transphosphorylatio 
ns from carbamoyl 
phosphate, hexose 
phosphates, 
pyrophosphate, 
phosphoenolpyruvat 
e and nucleoside di- 
and triphosphates, 
to D-glucose, D- 
mannose, 3-methyl- 
D-glucose, or 2- 
deoxy-D-glucose (cf. 
EC 2.7.1.62, EC 
2.7.1.79, and EC 
3.9.1.1) 




Alpha-amylase 


Acts on starch, 
glycogen and 
related 

polysaccharides and 
oligosaccharides in 
a random manner; 
reducing groups are 
liberated in the 
alpha-configuration 




Oligo-1,6- 
glucosidase 


Also hydrolyses 
palatinose The 
enzyme from 
intestinal mucosa is 
a single polypeptide 
chain also catalysing 
the reaction of EC 
3.2.1.48 
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Maltose- 
6[PRIME]- 
phosphate 
glucosidase 


Hydrolyses a variety 
of 6-phospho-D- 
glucosides, including 
maltose 6- 
phosphate, 
alpha[PRIME]alpha- 
trehalose 6- 
phosphate, sucrose 
6-phosphate and p- 
nitrophenyl-alpha-D- 
glucopyranoside 6- 
phosphate (as a 
chromogenic 
substrate) The 
enzyme is activated 
by Fe(ll), Mn(ll), 
Co(ll)and Ni(ll). It is 
rapidly inactivated in 
air 




Polygalacturona 
se 






Beta-amylase 


Acts on starch, 
glycogen and 
related 

polysaccharides and 
oligosaccharides 
producing beta- 
maltose by an 
inversion 




Alpha- 
glucosidase 


Group of enzymes 
whose specificity is 
directed mainly 
towards the 
exohydrolysis of 1,4- 
alpha-glucosidic 
linkages, and that 
hydrolyse 
oligosaccharides 
rapidly, relative to 
polysaccharides, 
which are 

hydrolysed relatively 
slowly, or not at all 
The intestinal 
enzyme also 
hydrolyses 
polysaccharides, 
catalysing the 
reactions of EC 
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3.2.1.3, and, more 
slowly, hydrolyses 
1,6-alpha- D- 
glucose links 




Beta- 

glucosidase 


Wide specificity for 
beta-D-glucosides. 
Some examples 
also hydrolyse one 
or more of the 
following: beta-D- 
galactosides, alpha- 
L- arabinosides, 
beta-D-xylosides, 
and beta-D- 
fucosides 




Beta- 

fructofuranosida 
se 


Substrates include 
sucrose Also 
catalyses 
fructotransferase 
reactions 




Alpha,alpha- 
trehalase 






Glucan 1,4- 

alpha- 

glucosidase 


Most forms of the 
enzyme can rapidly 
hydrolyse 1,6-alpha- 
D-glucosidic bonds 
when the next bond 
in sequence is 1,4, 
and some 
preparations of this 
enzyme hydrolyse 
1,6- and 1,3-alpha- 
D-glucosidic bonds 
in other 

polysaccharides 
This entry covers all 
such enzymes 
acting on 
polysaccharides 
more rapidly than on 
oligosaccharides EC 
3.2.1.20 from 
mammalian intestine 
can catalyse similar 
reactions 




Beta- 

glucuronidase 






Amylo-1,6- 
glucosidase 


In mammals and 
yeast this enzyme is 
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linked to a 
glycosyltransferase 
similar to EC 
2.4.1.25; together 
these two activities 
constitute the 
glycogen 

debranching system 




Xylan 1,4-beta- 
xylosidase 


Also hydrolyses 
xylobiose Some 
other 

exoglycosidase 
activities have been 
found associated 
with this enzyme in 
sheep liver 




Glucan endo- 

1,3-beta-D- 

glucosidase 


Very limited action 
on mixed-link (1,3- 
1 ,4-)-beta-D-glucans 
Hydrolyses 
laminarin, 
paramylon and 
pachyman Different 
from EC 3.2.1.6 




Cellulase 


Will also hydrolyse 
1 ,4-linkages in beta- 
D-glucans also 
containing 1,3- 
linkages 




Sucrose alpha- 
glucosidase 


This enzyme is 
isolated from 
intestinal mucosa as 
a single polypeptide 
chain also displaying 
activity towards 
isomaltose (oligo- 
1 ,6-glucosidase, cf. 
EC 3.2.1.10) 




Cyclomaltodextri 
nase 


Also hydrolyses 
linear maltodextrin 




Glucan 1,3-beta- 
glucosidase 


Acts on 

oligosaccharides but 
very slowly on 
laminaribiose 




Levanase 






Galacturan 1,4- 
alpha- 

galacturonidase 






Glucan 1 ,4-beta- 


Acts on 1 ,4-beta-D- 
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glucosidase 


glucans and related 
oliaosaccharides 
Cellobiose is 
hydrolysed, very 
slowly 






Cellulose 1,4- 
beta- 

cellobiosidase 








Alpha,alpha- 

phosphotrehalas 

e 








ADP-sugar 
diphosphatase 


Has a distinct 
specificity from the 
UDP-sugar 
pyrophosphatase 
(EC 3.6.1.45) 






Nucleotide 

pyrophosphatas 

e 


Substrates include 
NAD(+), NADP(+), 
FAD, CoA and also 
ATP and ADP 






UDP- 

glucuronate 
decarboxylase 




fU 

s ; : 




CDP-glucose 
4,6-dehydratase 








CDP-abequose 
epimerase 




3 tr 




UDP- 

glucuronate 4- 
epimerase 




r : 

jss:. 




GIucose-6- 
phosphate 
isomerase 


Also catalyses the 
anomerization of D- 
glucose 6- 
phosphate 






Phosphoglucom 
utase 


Maximum activity is 
only obtained in the 
presence of alpha- 
D-giucose 1 ,6- 
bisphosphate. This 
bisphosphate is an 
intermediate in the 
reaction, being 
formed by transfer of 
a phosphate residue 
from the enzyme to 
the substrate, but 
the dissociation of 
bisphosphate from 
the enzyme complex 
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is much slower than 
the overall 
isomerization Also, 
more slowly, 
catalyses the 
interconversion of 1- 
phosphate and 6- 
phosphate isomers 
of many other alpha- 
D-hexoses, and the 
interconversion of 
alpha-D-ribose 1- 
phosphate and 5- 
phosphate 




Beta- 

phosphoglucom 
utase 






Maltose alpha- 
D- 

glucosyltransfera 
se 




Tryptophan metabolism 


lndole-3-lactate 
dehydrogenase 






lndole-3- 
acetaldehyde 
reductase 
(NADH) 






lndole-3- 
acetaldehyde 
reductase 
(NADPH) 






3-hydroxyacyl- 
CoA 

dehydrogenase 


Also oxidizes S-3- 

hydroxyacyl-N- 

acylthioethanolamin 

e and S-3- 

hyd roxya cy I hyd ro I i p 

oate Some enzymes 

act, more slowly, 

with NADP(+) Broad 

specificity to acyl 

chain-length (cf. EC 

1.1.1.211) 




O-aminophenol 
oxidase 


Isophenoxazine may 
be formed by a 
secondary 
condensation from 
the initial oxidation 
product 




Catalase 


This enzyme can 
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- 


also act as a 
peroxidase (EC 
1.1 1.1.7) for which 
several organic 
substances, 
especially ethanol, 
can act as a 
hydrogen donor A 
manganese protein 
containing Mn(lll) in 
the resting state, 
which also belongs 
here, is often called 
pseudocatalase 
Enzymes from some 
microorganisms, 
such as Penicillium 
simplicissimum, 
which exhibit both 
catalase and 
peroxidase activity, 
have sometimes 
been referred to as 
catalase-peroxidase 




7,8- 

dihydroxykynure 
nate 8,8A- 
dioxygenase 






Tryptophan 2,3- 
dioxygenase 


Broad specificity 
towards tryptamine 
and derivatives 
including D- and L- 
tryptophan, 5- 
hydroxytryptophan 
and serotonin 




Indole 2,3- 
dioxygenase 


The enzyme from 
jasminum is a 
flavoprotein 
containing copper, 
and forms 
anthranilate as the 
final product One 
enzyme from 
Tecoma stans is 
also a flavoprotein 
containing copper 
and uses three 
atoms of oxygen per 
molecule of indole, 
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to form anthranil 
(3,4-benzisoxazole) 
A second enzyme 
from Tecoma stans, 
which is not a 
flavoprotein, uses 
four atoms of 
oxygen and forms 
anthranilate as the 
final product 




2,3- 

dihydroxyindole 
2,3-dioxygenase 




• 


Indoleamine- 
pyrrole 2,3- 
dioxygenase 


Acts on many 
substituted and 
unsubstituted 
indoleamines, 
including melatonin 
Involved in the 
degradation of 
melatonin 




3- 

hydroxyanthranil 
ate 3,4- 
dioxygenase 


The product of the 
reaction 
spontaneously 
rearrange to 
quinolinic acid (quin) 




Tryptophan 2- 
monooxygenase 






Tryptophan 

2[PRIME]- 

dioxygenase 


Acts on a number of 
indoIyl-3-alkane 
derivatives, oxidizing 
the 3-side-chain in 
the 2[PRIME]- 
position. Best 
substrates are L- 
tryptophan and 5- 
hydroxy-L- 
tryptophan 




Kynurenine 3- 
monooxygenase 






Unspecific 
monooxygenase 


Acts on a wide 
range of substrates 
including many 
xenobiotics, 
steroids, fatty acids, 
vitamins and 
prostaglandins 
Reactions catalysed 
include 
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hydroxylation, 
epoxidation, N- 
oxidation, 
sulfooxidation, N-, 
S- and O- 
dealkylations, 
desulfation, 
deamination, and 
reduction of azo, 
nitro, and N-oxide 
groups 




Anthranilate 3- 
monooxygenase 






Tryptophan 5- 
monooxygenase 


Activated by 
phosphorylation, 
catalysed by a 
CA(2+)-activated 
protein kinase 




Kynurenine 7,8- 
hydroxylase 






Aldehyde 

dehydrogenase 

(NAD+) 


Wide specificity, 
including oxidation 
of D- 

glucuronolactone to 
D-glucarate 




Aminomuconate- 

semialdehyde 

dehydrogenase 


Also acts on 2- 

hydroxymuconate 

semialdehyde 




Aldehyde 
oxidase 


Also oxidizes 
quinoline and 
pyridine derivatives 
May be identical 
with EC 1.1.3.22 




lndole-3- 

acetaldehyde 

oxidase 


Also oxidizes indole- 
3-aldehyde and 
acetaldehyde, more 
slowly 




Oxoglutarate 

dehydrogenase 

(lipoamide) 


Component of the 
multienzyme 2- 
oxoglutarate 
dehydrogenase 
complex 




Kynurenate-7,8- 

dihydrodiol 

dehydrogenase 






Glutaryl-CoA 
dehydrogenase 






L-amino acid 
oxidase 
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Amine oxidase 
(flavin- 
containing) 


Acts on primary 
amines, and usually 
also on secondary 
and tertiary amines 




Amine oxidase 
(copper- 
containing) 


A group of enzymes 
including those 
oxidizing primary 
amines, diamines 
and histamine One 
form of EC 1.3.1.15 
from rat kidney also 
catalyses this 
reaction 




Acetyl indoxyl 
oxidase 






Acetylserotonin 
O- 

methyltransferas 
e 


Some other 
hydroxyindoles also 
act as acceptor, 
more slowly 




lndole-3- 
pyruvate C- 
methyltransferas 
e 






Amine N- 

methyltransferas 

e 


A wide range of 
primary, secondary, 
and tertiary amines 
can act as 
acceptors, including 
tryptamine, aniline, 
nicotine and a 
variety of drugs and 
other xenobiotics 




Aralkylamine N- 
acetyltransferas 
e 


Narrow specificity 
towards 
aralkylamines, 
including serotonin 
Not identical with EC 
2.3.1.5 




Acetyl-CoA C- 

acetyltransferas 

e 






Tryptophan 

aminotransferas 

e 


Also acts on 5- 
hydroxytryptophan 
and, to a lesser 
extent on the phenyl 
amino acids 




Kynurenine- 
oxoglutarate 
aminotransferas 


Also acts on 3- 
hydroxykynurenine 
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e 






Thioglucosidase 


Has a wide 
specificity for 
thioglycosides 




Amidase 






Formamidase 


Also acts, more 
slowly, on 
acetamide, 
propanamide and 
butanamide 




Arylformamidase 


Also acts on other 

aromatic 

formylamines 




Nitrilase 


Acts on a wide 
range of aromatic 
nitriles including 
(indole-3- yl)- 
acetonitrile and also 
on some aliphatic 
nitriles, and on the 
corresponding acid 
amides (cf. EC 
4.2.1.84) 




Kynureninase 


Also acts on 

3[PRIME]- 

hydroxykynurenine 

and some other (3- 

arylcarbonyl)- 

alanines 




Aromatic-L- 

amino-acid 

decarboxylase 


Also acts on L- 
tryptophan, 5- 
hydroxy-L- 
tryptophan and 
dihydroxy- L- 
phenylalanine 
(DOPA) 




Phenylpyruvate 
decarboxylase 


Also acts on indole- 
s-pyruvate 




Aminocarboxym 
uconate- 
semialdehyde 
decarboxylase 


The product 
rearranges non- 
enzymically to 
picolinate 




Tryptophanase 


Also catalyses the 
synthesis of 
tryptophan from 
indole and serine 
Also catalyses 2,3- 
elimination and 
beta-replacement 
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reactions of some 
indole-substituted 
tryptophan analogs 
of L-cysteine, L- 
serine and other 3- 
substituted amino 
acids 




Enoyl-CoA 
hydratase 


Acts in the reverse 
direction With cis- 
compounds, yields 
(3R)-3-hydroxyacyl- 
CoA (cf. EC 
4.2.1.74) 




Nitrile hydratase 


Acts on short-chain 
aliphatic nitriles, 
converting them into 
the corresponding 
acid amides Does 
not act on these 
amides or on 
aromatic nitriles (cf 
EC 3.5.5.1) 




Tryptophan- 
tRNA ligase 




Tyrosine metabolism 


Alcohol 

dehydrogenase 


Acts on primary or 
secondary alcohols 
or hemiacetals The 
animal, but not the 
yeast, enzyme acts 
also on cyclic 
secondary alcohols 




(R)-4- 

hydroxyphenylla 
ctate 

dehydrogenase 


Also acts, more 
slowly, on (R)-3- 
phenyllactate, (R)-3- 
(indole-3- yl)lactate 
and (R)-lactate 




Hydroxyphenylp 

yruvate 

reductase 


Also acts on 3-(3,4- 
dihydroxyphenyl)lact 
ate Involved with EC 
2.3.1.140 in the 
biosynthesis of 
rosmarinic acid 




Aryl-alcohol 
dehydrogenase 


A group of enzymes 
with broad specificity 
towards primary 
alcohols with an 
aromatic or 
cyclohex-1-ene ring, 
but with low or no 
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activity towards 
short-chain aliphatic 
alcohols 




Catechol 
oxidase 


Also acts on a 
variety of substituted 
catechols Many of 
these enzymes also 
catalyse the reaction 
listed under EC 
1.14.18.1; this is 
especially true for 
the classical 
tyrosinase 




Iodide 
peroxidase 






3,4- 

dihydroxyphenyl 
acetate 2,3- 
dioxygenase 






4- 

hydroxyphenylpy 
ruvate 

dioxygenase 






Stizolobate 
synthase 


The intermediate 
product undergoes 
ring closure and 
oxidation, with 
NAD(P)(+) as 
acceptor, to 
stizolobic acid 




Stizolobinate 
synthase 


The intermediate 
product undergoes 
ring closure and 
oxidation, with 
NAD(P)(+) as 
acceptor, to 
stizolobinic acid 




Gentisate 1,2- 
dioxygenase 






Homogentisate 
1,2-dioxygenase 






4- 

hydroxyphenylac 
etate 1- 

monooxygenase 


Also acts on 4- 
hyd roxy hyd ratropate 
forming 2- 
methylhomogentisat 
e and on 4- 
hydroxyphenoxyacet 
ate forming 
hydroquinone and 
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glycolate 




4- 

hydroxyphenylac 
etate 3- 

monooxygenase 


1 




Tyrosine N- 
monooxygenase 






Hydroxyphenyla 
cetonitrile 2- 
monooxygenase 






Tyrosine 3- 
monooxygenase 


Activated by 
phosphorylation, 
catalysed by EC 
2.7.1.128 




Dopamine-beta- 
monooxygenase 


Stimulated by 
fu ma rate 




Monophenol 
monooxygenase 


A group of copper 
proteins that also 
catalyse the reaction 
of EC 1.10.3.1, if 
only 1,2- 
benzenediols are 
available as 
substrate 




Succinate- 
semialdehyde 
dehydrogenase 
(NAD(P)+) 






Aryl-aldehyde 
dehydrogenase 


Oxidizes a number 
of aromatic 
aldehydes, but not 
aliphatic aldehydes 




Aldehyde 

dehydrogenase 

(NAD+) 


Wide specificity, 
including oxidation 
of D- 

glucuronolactone to 
D-glucarate 




4-carboxy-2- 

hydroxymuconat 

e-6- 

semialdehyde 
dehydrogenase 


Does not act on 
unsubstituted 
aliphatic or aromatic 
aldehydes or 
glucose NAD(+) can 
replace NADP(+), 
but with lower 
affinity 




Aldehyde 

dehydrogenase 

(NAD(P)+) 






4- 


With EC 4.2.1.87, 
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hydroxyphenylac 

etaldehyde 

dehydrogenase 


brings about the 
metabolism of 
octopamine in 
Pseudomonas 




Aldehyde 
oxidase 


Also oxidizes 
quinoline and 
pyridine derivatives 
May be identical 
with EC 1.1.3.22 




L-amino acid 
oxidase 






Amine oxidase 
(flavin- 
containing) 


Acts on primary 
amines, and usually 
also on secondary 
and tertiary amines 




Amine oxidase 
(copper- 
containing) 


A group of enzymes 
including those 
oxidizing primary 
amines, diamines 
and histamine One 
form of EC 1.3.1.15 
from rat kidney also 
catalyses this 
reaction 




Aralkylamine 
dehydrogenase 


Phenazine 
methosulfate can act 
as acceptor Acts on 
aromatic amines 
and, more slowly, on 
some long-chain 
aliphatic amines, but 
not on methylamine 
or ethylamine (cf EC 
1.4.99.3) 




Phenol O- 

methyltransferas 

e 


Acts on a wide 
variety of simple 
alkyl-, methoxy- and 
halo-phenols 




Tyramine N- 

methyltransferas 

e 


Has some activity on 

phenylethylamine 

analogs 




Phenylethanola 
mine N- 

methyltransferas 
e 


Acts on various 
phenylethanolamine 
s; converts 
noradrenalin into 
adrenalin 




Catechol O- 

methyltransferas 

e 


The mammalian 
enzymes act more 
rapidly on 
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catecholamines 

vulvvl IWIUI 1 III iwvJ 

such as adrenaline 
or noradrenaline 
than on catechols 




Glutamine N- 

phenylacetyltran 

sferase 






Rosmarinate 

I » \mf III W 1 1 1 1 ^p*4 W 

synthase 


Involved with EC 
1.1.1.237 in the 
biosynthesis of 
rosmarinic acid 




Hvdroxvmandelo 

i i y vi i v//\ y iii t* i iuviw 

nitrile 

glucosyltransfera 
se 


3 4- 

dihydroxymandelonit 
rile can also act as 
acceptor 




Aspartate 

aminotransferas 

e 


Also acts on L- 
tyrosine, L- 
phenylalanine and 
L-tryptophan. This 
activity can be 
formed from EC 
2.6.1.57 by 
controlled 
proteolysis 




Dihvd roxvohenvl 

ill y vi i wav i iwi i y ( 

alanine 

aminotransferas 
e 






Tyrosine 

aminotransferas 

e 


L-phenylalanine can 
act instead of L- 
tyrosine The 
mitochondrial 
enzyme may be 
identical with EC 
2.6.1.1 The three 
isoenzymic forms 
are interconverted 
by EC 3.4.22.4 




Aromatic amino 
acid transferase 


L-methionine can 
also act as donor, 
more slowly 
Oxaloacetate can 
act as acceptor 
Controlled 
proteolysis converts 
the enzyme to EC 
2.6.1.1 




Histidinol- 
phosphate 
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aminotransferas 
e 






Fumarylacetoac 
etase 


Also acts on other 
3,5- and 2,4-dioxo 
acids 




Acylpyruvate 
hydrolase 


Acts on 

formylpyruvate, 2,4- 
dioxopentanoate, 
2,4-dioxohexanoate 
and 2,4- 

dioxoheptanoate 




Tyrosine 
decarboxylase 


The bacterial 
enzyme also acts on 
3-hydroxytyrosine 
and, more slowly, on 
3- 

hydroxyphenylalanin 
e 




Aromatic-L- 

amino-acid 

decarboxylase 


Also acts on L- 
tryptophan, 5- 
hydroxy-L- 
tryptophan and 
di hydroxy- L- 
phenylalanine 
(DOPA) 




Gentisate 
decarboxylase 






5-oxo pent-3- 
ene-1,2,5- 
tricarboxylate 
decarboxylase 






Tyrosine phenol- 
lyase 


Also slowly 
catalyses pyruvate 
formation from D- 
tyrosine, S-methyl- 
L-cysteine, L- 
cysteine, L-serine 
and D-serine 




(S)- 

norcoclaurine 
synthase 


The reaction makes 
a 6-membered ring 
by forming a bond 
between C-6 of the 
3,4-dihydroxyphenyl 
group of the 
dopamine and C-1 
of the aldehyde in 
the imine formed 
between the 
substrates The 
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product is the 
precursor of the 
benzylisoquinoline 
alkaloids in plants 
Will also catalyse 
the reaction of 4-(2- 
aminoethyl)benzene 
-1 .2-diol + (3.4- 
dihydroxyphenyl)ace 
taldehyde to form 
(S)- 

norlaudanosoline, 
but this alkaloid has 
not been found to 
occur in plants 




Dihydroxyphenyl 
alanine 

ammonia-lyase 






Phenylalanine 
ammonia-lyase 


May also act on L- 
tyrosine 




Maleylacetoacet 
ate isomerase 


Also acts on 
maleylpyruvate 




Maleylpyruvate 
isomerase 






Phenylpyruvate 
tautomerase 


Also acts on other 
arylpyruvates 




5- 

carboxymethyl- 
2- 

hydroxymuconat 
e delta- 
isomerase 






Tyrosine 2,3- 
aminomutase 






Phenylacetate-- 
CoA ligase 


Also acts, more 
slowly, on acetate, 
propanoate and 
butanoate, but not 
on hydroxy 
derivatives of 
phenylacetate and 
related compounds 
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Useful promoters include those that are capable of facilitating preferential transcription, 
i.e. tissue-specific or developmentally regulated gene expression and being a component of 
facile systems to evaluate the metabolic/physiological state of a plant cell, tissue or organ. Many 
such promoters are included in this application. Operably linking a sequence to these promoters 
that can act as a reporter and inserting the construct into a plant allows detection of the 
preferential in plantar transcription. For example, the quantitative state of responses to 
environmental conditions can be detected by using a plant having a construct that contains a 
stress-inducible promoter linked to and controlling expression of a sequence encoding GFP. 
The greater the stress promoter is induced, the greater the levels of fluorescence from GFP will 
be produced and this provides a measure of the level of stress being expressed by the plant 
and/or the ability of the plant to respond internally to the stress. 

More specifically, using this system the activities of any metabolic pathway (catabolic 
and anabolic), stress-related pathways as on any plant gene repeated activity can be monitored. 
In addition, assays can be developed using this sentinel system to select for superior genotypes 
with greater yield characteristics or to select for plants with altered responses to chemical, 
herbicide, or plant growth regulators or to identify chemical, herbicides or plant growth 
regulators by their response on such sentinels. 

Specifically, a promoter that is regulated in plants in the desired way, is operably linked 
to a reporter such as GFP, RFP, etc., and the constructs are introduced into the plant of interest. 
The behavior of the reporter is monitored using technologies typically specific for that reporter. 
With GFP, RFP, etc., it could typically be by microscopy of whole plants, organs, tissues or cells 
under excitation by an appropriate wavelength of UV light. 
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BRIEF DESCRIPTION OF THE TABLES 



1 . Reference and Sequence Tables 

The sequences of exemplary SDFs and polypeptides corresponding to the coding 
sequences of the instant invention are described in the Reference and Sequence Tables 
(sometimes referred to as the REF and SEQ Tables. The Reference Table refers to a number 
of "Maximum Length Sequences" or "MLS." Each MLS corresponds to the longest cDNA 
obtained, either by cloning or by the prediction from genomic sequence. The sequence of the 
MLS is the cDNA sequence as described in the Av subsection of the Reference Table. 
The Reference Table includes the following information relating to each MLS: 

I. cDNA Sequence 

A. 5' UTR 

B. Coding Sequence 

C. 3' UTR 

II. Genomic Sequence 

A. Exons 

B. Introns 

C. Promoters 

III. Link of cDNA Sequences to Clone IDs 

IV. Multiple Transcription Start Sites 

V. Polypeptide Sequences 

A. Signal Peptide 

B. Domains 

C. Related Polypeptides 

VI. Related Polynucleotide Sequences 

I. cDNA SEQUENCE 

The Reference Table indicates which sequence in the Sequence Table represents the 
sequence of each MLS. The MLS sequence can comprise 5' and 3' UTR as well as coding 
sequences. In addition, specific cDNA clone numbers also are included in the Reference 
Table when the MLS sequence relates to a specific cDNA clone. 

A. 5 5 UTR 

The location of the 5' UTR can be determined by comparing the most 5' MLS 
sequence with the corresponding genomic sequence as indicated in the Reference Table. The 
sequence that matches, beginning at any of the transcriptional start sites and ending at the last 
nucleotide before any of the translational start sites corresponds to the 5' UTR. 

B. Coding Region 

The coding region is the sequence in any open reading frame found in the MLS. 
Coding regions of interest are indicated in the PolyP SEQ subsection of the Reference Table. 

C. 3' UTR 

The location of the 3' UTR can be determined by comparing the most 3' MLS 
sequence with the corresponding genomic sequence as indicated in the Reference Table. The 
sequence that matches, beginning at the translational stop site and ending at the last 
nucleotide of the MLS corresponds to the 3' UTR. 
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H. GENOMIC SEQUENCE 

Further, the Reference Table indicates the specific "gi" number of the genomic 
sequence if the sequence resides in a public databank. For each genomic sequence, 
Reference tables indicate which regions are included in the MLS. These regions can include 
the 5' and 3' UTRs as well as the coding sequence of the MLS. See, for example, the scheme 
below: 



Region 1 



Region 2 



Region 3 



| 5 ■ UTR | Exon 



■ | Exon | - 



■I Exon | 3 1 UTR 



I A I 

Promoter I Intron 

Translational 
Start Site 



Intron 



Stop Codon 



The Reference Table reports the first and last base of each region that are 
included in an MLS sequence. An example is shown below: 
gi No. 47000: 
37102 ... 37497 
37593 ... 37925 

The numbers indicate that the MLS contains the following sequences from two 
regions of gi No. 47000; a first region including bases 37102-37497, and a second region 
including bases 37593-37925. 

A. EXON SEQUENCES 
The location of the exons can be determined by comparing the sequence of the 
regions from the genomic sequences with the corresponding MLS sequence as indicated by 
the Reference Table. 



i. INITIAL EXON 
To determine the location of the initial exon, information from the 

(1) polypeptide sequence section; 

(2) cDNA polynucleotide section; and 

(3) the genomic sequence section 
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of the Reference Table is used. First, the polypeptide section will indicate where the 
translational start site is located in the MLS sequence. The MLS sequence can be matched to 
the genomic sequence that corresponds to the MLS. Based on the match between the MLS 
and corresponding genomic sequences , the location of the translational start site can be 
determined in one of the regions of the genomic sequence. The location of this translational 
start site is the start of the first exon. 

Generally, the last base of the exon of the corresponding genomic region, in which the 
translational start site was located, will represent the end of the initial exon. In some cases, 
the initial exon will end with a stop codon, when the initial exon is the only exon. 

In the case when sequences representing the MLS are in the positive strand of 
the corresponding genomic sequence, the last base will be a larger number than the first base. 
When the sequences representing the MLS are in the negative strand of the corresponding 
genomic sequence, then the last base will be a smaller number than the first base. 

ii. INTERNAL EXONS 
Except for the regions that comprise the 5' and 3' UTRs, initial exon, and terminal 
exon, the remaining genomic regions that match the MLS sequence are the internal exons. 
Specifically, the bases defining the boundaries of the remaining regions also define the 
intron/exon junctions of the internal exons. 



iii. TERMINAL EXON 
As with the initial exon, the location of the terminal exon is determined with 
information from the 

(1) polypeptide sequence section; 

(2) cDNA polynucleotide section; and 

(3) the genomic sequence section 

of the Reference Table. The polypeptide section will indicate where the stop codon is 
located in the MLS sequence. The MLS sequence can be matched to the corresponding 
genomic sequence. Based on the match between MLS and corresponding genomic 
sequences, the location of the stop codon can be determined in one of the regions of the 
genomic sequence. The location of this stop codon is the end of the terminal exon. 
Generally, the first base of the exon of the corresponding genomic region that matches the 
cDNA sequence, in which the stop codon was located, will represent the beginning of the 
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terminal exon. In some cases, the translational start site will represent the start of the 
terminal exon, which will be the only exon. 

In the case when the MLS sequences are in the positive strand of the 
corresponding genomic sequence, the last base will be a larger number than the.first base. 
When the MLS sequences are in the negative strand of the corresponding genomic sequence, 
then the last base will be a smaller number than the first base. 




B. INTRON SEQUENCES 
In addition, the introns corresponding to the MLS are defined by identifying the 
genomic sequence located between the regions where the genomic sequence comprises 
exons. Thus, introns are defined as starting one base downstream of a genomic region 
comprising an exon, and end one base upstream from a genomic region comprising an exon. 

C. PROMOTER SEQUENCES 

As indicated below, promoter sequences corresponding to the MLS are defined as 
sequences upstream of the first exon; more usually, as sequences upstream of the first of 
multiple transcription start sites; even more usually as sequences about 2,000 nucleotides 
upstream of the first of multiple transcription start sites. 

III. LINK of cDNA SEQUENCES to CLONE IDs 

As noted above, the Reference Table identifies the cDNA clone(s) that relate to each 
MLS. The MLS sequence can be longer than the sequences included in the cDNA clones. In 
such a case, the Reference Table indicates the region of the MLS that is included in the clone. 
If either the 5' or 3' termini of the cDNA clone sequence is the same as the MLS sequence, 
no mention will be made. 

IV. Multiple Transcription Start Sites 

Initiation of transcription can occur at a number of sites of the gene. The 
Reference Table indicates the possible multiple transcription sites for each gene. In the 
Reference Table, the location of the transcription start, sites can be either a positive or 
negative number. 
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The positions indicated by positive numbers refer to the transcription start sites as 
located in the MLS sequence. The negative numbers indicate the transcription start site 
within the genomic sequence that corresponds to the MLS. 

To determine the location of the transcription start sites with the negative 
numbers, the MLS sequence is aligned with the corresponding genomic sequence. In the 
instances when a public genomic sequence is referenced, the relevant corresponding genomic 
sequence can be found by direct reference to the nucleotide sequence indicated by the "gi" 
number shown in the public genomic DNA section of the Reference Table. When the 
position is a negative number, the transcription start site is located in the corresponding 
genomic sequence upstream of the base that matches the beginning of the MLS sequence in 
the alignment. The negative number is relative to the first base of the MLS sequence which 
matches the genomic sequence corresponding to the relevant "gi" number. 

In the instances when no public genomic DNA is referenced, the relevant 
nucleotide sequence for alignment is the nucleotide sequence associated with the amino acid 
sequence designated by "gi" number of the later PolyP SEQ subsection. 

V. Polypeptide Sequences 

The PolyP SEQ subsection lists SEQ ID NOs and Ceres SEQ ID NO for polypeptide 
sequences corresponding to the coding sequence of the MLS sequence and the location of the 
translational start site with the coding sequence of the MLS sequence. 

The MLS sequence can have multiple translational start sites and can be capable of 
producing more than one polypeptide sequence. 

A. Signal Peptide 

The Reference tables also indicate in subsection (B) the cleavage site of the putative 
signal peptide of the polypeptide corresponding to the coding sequence of the MLS sequence. 
Typically, signal peptide coding sequences comprise a sequence encoding the first residue of 
the polypeptide to the cleavage site residue. 
B. Domains 

Subsection (C) provides information regarding identified domains (where present) 
within the polypeptide and (where present) a name for the polypeptide domain. 

C. Related Polypeptides 
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Subsection (Dp) provides (where present) information concerning amino acid 
sequences that are found to be related and have some percentage of sequence identity to the 
polypeptide sequences of the Reference and Sequence Tables. These related sequences are 
identified by a "gi" number. 



VI. Related Polynucleotide Sequences 

Subsection (Dn) provides polynucleotide sequences (where present) that are related to 
and have some percentage of sequence identity to the MLS or corresponding genomic 
sequence. 



Abbreviation 


Description 


Max Len. Seq. 


Maximum Length Sequence 


rel to 


Related to 


Clone Ids 


Clone ED numbers 


Pub gDNA 


Public Genomic DNA 


gi No. 


gi number 


Gen. Seq. in Cdna 


Genomic Sequence in cDNA 

(Each region for a single gene prediction is 

listed on a separate line. 

In the case of multiple gene predictions, the 

group of regions relating to a single prediction 

are separated by a blank line) 


(Ac) cDNA SEQ 


cDNA sequence 


- Pat. Appln. SEQ ID NO 


Patent Application SEQ ID NO: 


- Ceres SEQ ID NO: 1673877 


Ceres SEQ ID NO: 


- SEQ # w. TSS 


Location within the cDNA sequence, SEQ ID 
NO:, of Transcription Start Sites which are 
listed below 


- Clone ID #: # -> # 


Clone ID comprises bases # to # of the cDNA 
Sequence 


PolyP SEQ 


Polypeptide Sequence 


- Pat. Appln. SEQ ID NO: 


Patent Application SEQ ID NO: 


- Ceres SEQ ID NO 


Ceres SEQ ID NO: 
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Abbreviation 


Description 


- Loc. SEQ ID NO: @ nt. 


Location of translational start site in cDNA of 
SEQ ID NO: at nucleotide number 


(C) Pred. PP Norn. & Annot. 


Nomination and Annotation of Domains within 
Predicted Polypeptide(s) 


- (Title) 


Name of Domain 


- Loc. SEQ ED NO #:#-># aa. 


Location of the domain within the polypeptide 
of SEQ ID NO: from # to # amino acid 
residues. 


(Dp) Rel. AA SEQ 


Related Amino Acid Sequences 


- Align. NO 


Alignment number 


- gi No 


Gi number 


- Desp. 


Description 


- % Idnt. 


Percent identity 


- Align. Len. 


Alignment Length 


- Loc. SEQ ID NO: # -> # aa 


Location within SEQ ID NO: from # to # 
amino acid residue. 



£g 2. Protein Group Table 

^ This table indicates groups of proteins that share a signature sequence (also 

p. referred to as a consensus sequence). The Protein group also referred to as the 

25 Ortholog group is named by the peptide ID with which all members were compared. 

Each group contains sequences that were included at the 10" 50 , 10~ 30 , and 10 -10 p-value 
cutoffs. For each group, the peptide ID and at which cutoff the peptide was included 
into the group. The same peptide ID may be included in the group three times as 
peptide ID 50, peptide ED 30 and peptide ID 10. The data indicates that peptide ID 
10 was included in the group when the threshold was either 10" 50 , 10~ 30 , or 10" °. All the 

peptide IDs that are followed by "50" were included in the protein group when the e- 
value cutoff was 10" 50 . All the peptide IDs that are followed by either "30" or "50" 
were included in the protein group when the threshold e-value was 10" 30 . All the 
peptide IDs that are followed by "10", "30" or "50" were included in the protein 
15 group when 10" 10 was used as the e-value cutoff. 

At the end of each protein group is a list of the consensus sequence that proteins share at the 10' 

50 , 10~ 30 , or 10" 10 . The consensus sequence contains both lower-case and upper-case letters. The 

upper-case letters represent the standard one-letter amino acid abbreviations. The lower case 

letters represent classes of amino acids: 
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• "t" refers to tiny amino acids, which are specifically alanine, glycine, serine and 
threonine. 

• "p"refers to polar amino acids, which are specifically, asparagine and glutamine 

• "n" refers to negatively charged amino acids, which are specifically, aspartic acid and 
glutamic acid 

• "+" refers to positively charged residues, which are specifically, lysine, arginine, and 
histidine 

"r" refers to aromatic residues, which are specifically, phenylalanine, tyrosine, and 
tryptophan, 

"a" refers to aliphatic residues, which are specifically, isoleucine, valine, leucine, and 
methonine 

3. Protein Group Matrix Table 

In addition to each consensus sequence, Applicants have generated a scoring matrix to 
provide further description of the consensus sequence. The first row of each matrix indicates 
the residue position in the consensus sequence. The matrix reports number of occurrences of 
all the amino acids that were found in the group members for every residue position of the 
signature sequence. The matrix also indicates for each residue position, how many different 
organisms were found to have a polypeptide in the group that included a residue at the 
relevant position. The last line of the matrix indicates all the amino acids that were found at 
each position of the consensus. 

4. MA diff Table 

The MA_diff Table presents the results of the differential expression experiments for 
the mRNAs, as reported by their corresponding cDNA ID number, that were differentially 
transcribed under a particular set of conditions as compared to a control sample. The cDNA 
ID numbers correspond to those utilized in the Reference and Sequence Tables. Increases in 
mRNA abundance levels in experimental plants versus the controls are denoted with the plus 
sign (+). Likewise, reductions in mRNA abundance levels in the experimental plants are 
denoted with the minus (-) sign. 
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The Table is organized according to each set of experimental conditions, which are 
denoted by the term "Expt ID:" followed by a particular number. The table below links each 
Expt ED with a short description of the experiment and the parameters. 

For each experiment ID a method of the normalization is specified. "Method: 2" 
represents normalization by median the goal of the method is to adjust the ratios by a factor 
so that the median of the ratio distribution is 1 . Method 3 is the normalization procedure 
conducted by Aglilent Technologies, Inc. Palo Alto, California, USA. 

The MA_diff Table also specifies the specific parameters and the experiment number 
(e.g. 107871) used in compiling the data. The experiment numbers are referenced in the 
appropriate utility/functions sections herein. The background threshold was set to 
"BKG_Threshold=X" to reduce the effect of the background on the signal. 

Finally, the Table includes reference to an "Organism_ID" number. This number 
refers to the cDNA spotted on the chip were similar to Arabidopsis thaliana (3769) sequences 
or whether the oligo used for the chips were similar to Zea mays (31 1987)sequences. 



5. MA diff (Experiment) Tables 



Example No. 


Experiment 


genome 


ExptID 


Value 


Parameter 


Units 




short name 












3ii 


3642-1 


Arabidopsis 


108512 


3746-1 


Plant Line 


Hours 


3n 


Arab 0.001% 
MeJA_l 


Arabidopsis 


108568 


Aerial 


Tissue 


Tissue 










0.001% MeJ 
A 


Treatment 


Compound 










1 


Timepoint 


Hours 


3n 


Arab 0.001% 
MeJA_l 


Arabidopsis 


108569 


Aerial 


Tissue 


Tissue 










6 


Timepoint 


Hours 










0.001% MeJ 
A 


Treatment 


Compound 


3j 


Arab_0.1uM_E 
pi-Brass_l 


Arabidopsis 


108580 


Aerial 


Tissue 


Tissue 










1 


Timepoint 


Hours 










0.1uM_Brassi 
no_Steroid 


Treatment 


Compound 


3j 


Arab_0.1uM_E 
pi-Brass_l 


Arabidopsis 


108581 


Aerial 


Tissue 


Tissue 










6 


Timepoint 


Hours 










0.1uM_Brassi 


Treatment 


Compound 
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no_Steroid 






3g 


Arab lOOuM 
ABA_1 


Arabidopsis 


108560 


Aerial 


Tissue 


Tissue 










1 


Timepoint 


Hours 










100uM_ABA 


Treatment 


Compound 


3g 


Arab lOOuM 
ABA_1 


Arabidopsis 


108561 


Aerial 


Tissue 


Tissue 










100uM_ABA 


Treatment 


Compound 










6 


Timepoint 


Hours 


31 


Arab lOOuM 
BA_1 


Arabidopsis 


108566 


Aerial 


Tissue 


Tissue 










1 


Timepoint 


Hours 










100uM_BA 


Treatment 


Compound 


31 


Arab lOOuM 
BA_1 


Arabidopsis 


108567 


Aerial 


Tissue 


Tissue 










lOOuM BA 


Treatment 


Compound 










6 


Timepoint 


Hours 


3k 


Arab lOOuM 
GA3_1 


Arabidopsis 


108562 


Aerial 


Tissue 


Tissue 










1 


Timepoint 


Hours 










lOOuM GA3 


Treatment 


Compound 


3k 


Arab lOOuM 
GA3_1 


Arabidopsis 


108563 


Aerial 


Tissue 


Tissue 










lOOuM GA3 


Treatment 


Compound 










6 


Timepoint 


Hours 


3h 


Arab lOOuM 
NAA_1 


Arabidopsis 


108564 


Aerial 


Tissue 


Tissue 










1 


Timepoint 


Hours 










100uM_NAA 


Treatment 


Compound 


3h 


Arab lOOuM 
NAA_1 


Arabidopsis 


108565 


Aerial 


Tissue 


Tissue 










100uM_NAA 


Treatment 


Compound 










6 


Timepoint 


Hours 


3r 


Arab 20% PE 
G_l 


Arabidopsis 


108570 


Aerial 


Tissue 


Tissue 










1 


Timepoint 


Hours 










20%PEG 


Treatment 


Compound 


3r 


Arab 20% PE 
G_l 


Arabidopsis 


108571 


Aerial 


Tissue 


Tissue 










20%PEG 


Treatment 


Compound 










6 


Timepoint 


Hours 


3o 


Arab 2mM SA 
J 


Arabidopsis 


108586 


Aerial 


Tissue 


Tissue 










2mM_SA 


Treatment 


Compound 










1 


Timepoint 


Hours 


3o 


Arab 2mM SA 
_1 


Arabidopsis 


108587 


Aerial 


Tissue 


Tissue 










6 


Timepoint 


Hours 
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2mM_SA 


Treatment 


Compound 


3u 


Arab_5mM_H2 
02_1 


Arabidopsis 


108582 


Aerial 


Tissue 


Tissue 










1 


Timepoint 


Hours 










5mM_H202 


Treatment 


Compound 


3u 


Arab 5mM H2 
02_1 


Arabidopsis 


108583 


Aerial 


Tissue 


Tissue 










5mM_H202 


Treatment 


Compound 










6 


Timepoint 


Hours 


3v 


Arab_5mM_Na 
NP_1 


Arabidopsis 


108584 


Aerial 


Tissue 


Tissue 










1 


Timepoint 


Hours 










5mM_NaNP 


Treatment 


Compound 


3v 


Arab 5mM Na 
NP_1 


Arabidopsis 


108585 


Aerial 


Tissue 


Tissue 










5mM_NaNP 


Treatment 


Compound 










6 


Timepoint 


Hours 


3t 


Arab_Cold_l 


Arabidopsis 


108578 


Aerial 


Tissue 


Tissue 










Cold 


Treatment 


Compound 










1 


Timepoint 


Hours 


3t 


Arab_Cold_l 


Arabidopsis 


108579 


Aerial 


Tissue 


Tissue 










6 


Timepoint 


Hours 










Cold 


Treatment 


Compound 


3g 


Arab Drought 
1 


Arabidopsis 


108572 


Aerial 


Tissue 


Tissue 










1 


Timepoint 


Hours 










Drought 


Treatment 


Compound 


3g 


Arab Drought 
1 


Arabidopsis 


108573 


Aerial 


Tissue 


Tissue 










Drought 


Treatment 


Compound 










6 


Timepoint 


Hours 


3s 


Arab_Heat_l 


Arabidopsis 


108576 


Aerial 


Tissue 


Tissue 










1 


Timepoint 


Hours 










Heat (42 deg 
Q 


Treatment 


Compound 


3s 


Arab_Heat_l 


Arabidopsis 


108577 


Aerial 


Tissue 


Tissue 










Heat (42 deg 
C) 


Treatment 


Compound 










6 


Timepoint 


Hours 


3aa (ovule) 


Arab_Ler- 
pi_ovule_l 


Arabidopsis 


108595 


Ler_pi 


Plant Line 


Hours 










Ovule 


Tissue 


Tissue 


3b 


Arab_Ler- 
rhl_root_l 


Arabidopsis 


108594 


Lerjrhl 


Plant Line 


Hours 










Root 


Tissue 


Tissue 


31 


Arab N03 H- 
to-L_l 


Arabidopsis 


108592 


Aerial 


Tissue 


Tissue 










Low Nitrogen 


Treatment 


Compound 
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12 


Timepoint 


Hours 


31 


Arab N03 H- 
to-L_l 


Arabidopsis 


108593 


Aerial 


Tissue 


Tissue 










24 


Timepoint 


Hours 










Low Nitrogen 


Treatment 


Compound 


31 


Arab N03 L- 
to-H_l 


Arabidopsis 


108588 


Aerial 


Tissue 


Tissue 










2 


Timepoint 


Hours 










Nitrogen 


Treatment 


Compound 


31 


Arab N03 L- 
to-H_l 


Arabidopsis 


108589 


Aerial 


Tissue 


Tissue 










Nitrogen 


Treatment 


Compound 










6 


Timepoint 


Hours 


31 


Arab N03 L- 
to-HJ 


Arabidopsis 


108590 


Aerial 


Tissue 


Tissue 










9 


Timepoint 


Hours 










Nitrogen 


Treatment 


Compound 


31 


Arab N03 L- 
to-H_l 


Arabidopsis 


108591 


Aerial 


Tissue 


Tissue 










Nitrogen 


Treatment 


Compound 










12 


Timepoint 


Hours 


3p 


Arab Woundin 
g_l 


Arabidopsis 


108574 


Aerial 


Tissue 


Tissue 










1 


Timepoint 


Hours 










Wounding 


Treatment 


Compound 


3p 


Arab Woundin 
g_l 


Arabidopsis 


108575 


Aerial 


Tissue 


Tissue 










Wounding 


Treatment 


Compound 










6 


Timepoint 


Hours 


3o 


Columbia/CS37 
26 flower SA 


Arabidopsis 


108475 


Columbia 


species 


Hours 










SA 


Treatment 


Compound 










5 weeks 


Timepoint 


Hours 


3o 


Columbia/CS37 
26 flower SA 


Arabidopsis 


108476 


CS3726 


species 


Hours 










5 weeks 


Timepoint 


Hours 










SA 


Treatment 


Compound 


3p 


Corn_0.001Per 
cent_MeJA 


Zea Mays 


108555 


Aerial 


Tissue 


Tissue 










24 


Timepoint 


Hours 










0.001% MeJ 
A 


Treatment 


Compound 


3j 


Corn_0.1uM_B 
rassino_Steroid 


Zea Mays 


108557 


24 


Timepoint 


Hours 










Aerial 


Tissue 


Tissue 










0.1uM_Brassi 
no_Steroid 


Treatment 


Compound 


3g 


Corn_100uM_ 


Zea Mays 


108513 


Aerial 


Tissue 


Tissue 
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ABA 




















ABA 


Treatment 


Compound 










6 


Timepoint 


Hours 


3g 


Corn lOOuM 
ABA 


Zea Mays 


108597 


Aerial 


Tissue 


Tissue 










24 


Timepoint 


Hours 










100uM_ABA 


Treatment 


Compound 


3i 


Corn lOOuM 
BA 


Zea Mays 


108517 


Aerial 


Tissue 


Tissue 










6 


Timepoint 


Hours 










BA 


Treatment 


Compound 


3k 


Corn lOOuM 
GA3 


Zea Mays 


108519 


Aerial 


Tissue 


Tissue 










lOOuM 

Giberillic 

Acid 


Treatment 


Compound 










1 


Timepoint 


Hours 


3k 


Corn lOOuM 
GA3 


Zea Mays 


108520 


Aerial 


Tissue 


Tissue 










6 


Timepoint 


Hours 










lOOuM 

Giberillic 

Acid 


Treatment 


Compound 


3k 


Corn lOOuM 
GA3 


Zea Mays 


108521 


Aerial 


Tissue 


Tissue 










lOOuM 

Giberillic 

Acid 


Treatment 


Compound 










12 


Timepoint 


Hours 


3h 


Corn lOOuM 
NAA 


Zea Mays 


108516 


Aerial 


Tissue 


Tissue 










NAA 


Treatment 


Compound 










6 


Timepoint 


Hours 


3h 


Corn lOOuM 
NAA 


Zea Mays 


108554 


Aerial 


Tissue 


Tissue 










24 


Timepoint 


Hours 










NAA 


Treatment 


Compound 


3hh 


Corn 1400- 
6/S-17 


Zea Mays 


108598 


Shoot apices 


Tissue 


Tissue 


3r 


Corn 150mM_ 
NaCl 


Zea Mays 


108541 


Aerial 


Tissue 


Tissue 










1 


Timepoint 


Hours 










150mM_NaCl 


Treatment 


Compound 


3r 


Corn 150mM 
NaCl 


Zea Mays 


108542 


Aerial 


Tissue 


Tissue 










150mM_NaCl 


Treatment 


Compound 










6 


Timepoint 


Hours 


3r 


Corn_150mM_ 


Zea Mays 


108553 


Aerial 


Tissue 


Tissue 



Page 637 of 772 



Attorney DKt. 

Client Dkt. 80090.005 





NaCl 




















24 


Timepoint 


Hours 










150mM_NaCl 


Treatment 


Compound 


3r 


Corn 20% PE 
G 


Zea Mays 


108539 


Aerial 


Tissue 


Tissue 










1 


Timepoint 


Hours 










20%PEG 


Treatment 


Compound 


3r 


Corn 20% PE 
G 


Zea Mays 


108540 


Aerial 


Tissue 


Tissue 










20%PEG 


Treatment 


Compound 










6 


Timepoint 


Hours 


3o 


Corn_2mM_SA 


Zea Mays 


108515 


Aerial 


Tissue 


Tissue 










SA 


Treatment 


Compound 










12 


Timepoint 


Hours 


3o 


Corn_2mM_SA 


Zea Mays 


108552 


Aerial 


Tissue 


Tissue 










SA 


Treatment 


Compound 










24 


Timepoint 


Hours 


3u 


Corn 5mM H2 
02 


Zea Mays 


108537 


Aerial 


Tissue 


Tissue 










H202 


Treatment 


Compound 










1 


Timepoint 


Hours 


3u 


Corn 5mM H2 
02 


Zea Mays 


108538 


Aerial 


Tissue 


Tissue 










6 


Timepoint 


Hours 










H202 


Treatment 


Compound 


3u 


Corn 5mM H2 
02 


Zea Mays 


108558 


Aerial 


Tissue 


Tissue 










24 


Timepoint 


Hours 










H202 


Treatment 


Compound 


3v 


Corn 5mM N 
0 


Zea Mays 


108526 


Aerial 


Tissue 


Tissue 










NO 


Treatment 


Compound 










1 


Timepoint 


Hours 


3v 


Corn 5mM N 
0 


Zea Mays 


108527 


Aerial 


Tissue 


Tissue 










6 


Timepoint 


Hours 










NO 


Treatment 


Compound 


3v 


Corn 5mM N 
0 


Zea Mays 


108559 


Aerial 


Tissue 


Tissue 










12 


Timepoint 


Hours 










NO 


Treatment 


Compound 


3t 


Corn_Cold 


Zea Mays 


108533 


Aerial 


Tissue 


Tissue 










1 


Timepoint 


Hours 










Cold 


Treatment 


Compound 


3t 


Corn_Cold 


Zea Mays 


108534 


Aerial 


Tissue 


Tissue 










Cold 


Treatment 


Compound 










6 


Timepoint 


Hours 
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3q 


Corn_Drought 


Zea Mays 


108502 


Drought 


Treatment 


Compound 












1 


Timepoint 


Hours 




3q 


Corn_Drought 


Zea Mays 


108503 


Drought 


Treatment 


Compound 












6 


Timepoint 


Hours 




3q 


Com_Drought 


Zea Mays 


108504 


Drought 


Treatment 


Compound 












12 


Timepoint 


Hours 




3q 


Corn_Drought 


Zea Mays 


108556 


Drought 


Treatment 


Compound 












24 


Timepoint 


Hours 




3s 


Corn_Heat 


Zea Mays 


108522 


Aerial 


Tissue 


Tissue 












1 


Timepoint 


Hours 












Heat (42 deg 


Treatment 


Compound 












C) 








3s 


Corn_Heat 


Zea Mays 


108523 


Aerial 


Tissue 


Tissue 












6 


Timepoint 


Hours 












Heat (42 deg 
C) 


Treatment 


Compound 




3gg 


Corn_Imbibed 
Seeds 


Zea Mays 


108518 


Imbibed 


Treatment 


Compound 












4 


Age 


days old 


o 

™: 










Roots 


Tissue 


Tissue 


ae J 


3gg 


Corn_Imbibed 


Zea Mays 


108528 


Imbibed 


Treatment 


Compound 






Seeds 






















Aerial 


Tissue 


Tissue 


fU 










5 


Age 


days old 


w 


3gg 


Corn_Imbibed 


Zea Mays 


108529 


Imbibed 


Treatment 


Compound 




Seeds 












S 










5 


Age 


days old 


ffi 










Root 


Tissue 


Tissue 


S w 


3gg 


Corn_Imbibed 


Zea Mays 


108530 


Imbibed 


Treatment 


Compound 




Seeds 












r: 










Aerial 


Tissue 


Tissue 


u. 










6 


Age 


days old 




3gg 


Corn_Imbibed 


Zea Mays 


108531 


Imbibed 


Treatment 


Compound ■ 




Seeds 






















6 


Age 


days old 












root 


Tissue 


Tissue 




3gg 


Corn_Imbibed 
Seeds 


Zea Mays 


108545 


Imbibed 


Treatment 


Compound 












Aerial 


Tissue 


Tissue 












3 


Age 


days old 




3gg 


Corn_Imbibed 
Seeds 


Zea Mays 


108546 


Imbibed 


Treatment 


Compound 












3 


Age 


days old 












Root 


Tissue 


Tissue 




3gg 


Corn_Imbibed 
Seeds 


Zea Mays 


108547 


Imbibed 


Treatment 


Compound 












Aerial 


Tissue 


Tissue 
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4 


Age 


days old 


3gg 


Corn_Imbibed_ 
Embryo_Endos 
perm 


Zea Mays 


108543 


2 


Age 


days old 










Imbibed 


Treatment 


Compound 










Embryo 


Tissue 


Tissue 


3gg 


Corn_Imbibed_ 
Embryo_Endos 
perm 


Zea Mays 


108544 


2 


Age 


days old 










Endosperm 


Tissue 


Tissue 










Imbibed 


Treatment 


Compound 


3ee 


Corn_Meristem 


Zea Mays 


108535 


Root 
Meristem 


Tissue 


Tissue 










192 


Timepoint 


Hours 


3ee 


Corn_Meristem 


Zea Mays 


108536 


Shoot 
Meristem 


Tissue 


Tissue 










192 


Timepoint 


Hours 


3n 


Corn_Nitrogen 
_H_to_L 


Zea Mays 


108532 


Roots 


Tissue 


Tissue 










Low Nitrogen 


Treatment 


Compound 










16 


Timepoint 


Hours 


3n 


Corn Nitrogen 
_H_to_L 


Zea Mays 


108548 


Root 


Tissue 


Tissue 










Low Nitrogen 


Treatment 


Compound 










4 


Timepoint 


Hours 


3m 


CornJNfitrogen 
_L_to_H 


Zea Mays 


108549 


Aerial 


Tissue 


Tissue 










0.166 


Timepoint 


Hours 










Nitrogen 


Treatment 


Compound 


3m 


Corn Nitrogen 
_L_to_H 


Zea Mays 


108550 


Aerial 


Tissue 


Tissue 










Nitrogen 


Treatment 


Compound 










1.5 


Timepoint 


Hours 


3m 


Corn_Nitrogen 
_L_to_H 


Zea Mays 


108551 


Aerial 


Tissue 


Tissue 










3 


Timepoint 


Hours 










Nitrogen 


Treatment 


Compound 


3ff 


Corn_RTl 


Zea Mays 


108599 


Unknown 


Plant Line 


Hours 










Root 


Tissue 


Tissue 


3p 


Corn_Woundin 
g 


Zea Mays 


108524 


Aerial 


Tissue 


Tissue 










Wounding 


Treatment 


Compound 










1 


Timepoint 


Hours 


3p 


Corn_Woundin 
g 


Zea Mays 


108525 


Aerial 


Tissue 


Tissue 










6 


Timepoint 


Hours 










Wounding 


Treatment 


Compound 


3g 


Drought_Flowe 


Arabidopsis 


108473 


Flowers 


Tissue 


Tissue 
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rs 




















7d 


Timepoint 


Hours 










Drought 


Treatment 


Compound 


3g 


Drought_Flowe 
rs 


Arabidopsis 


108474 


Flowers 


Tissue 


Tissue 










Drought 


Treatment 


Compound 










8d(ld- 
post_re- 
watering) 


Timepoint 


Hours 


3k 


GA Treated 


Arabidopsis 


108484 


1 


Timepoint 


Hours 










1 


Timepoint 


Hours 


3k 


GA Treated 


Arabidopsis 


108485 


6 


Timepoint 


Hours 










6 


Timepoint 


Hours 


3k 


GA Treated 


Arabidopsis 


108486 


12 


Timepoint 


Hours 










12 


Timepoint 


Hours 


3e 


Germinating 
Seeds 


Arabidopsis 


108461 


Day 1 


Timepoint 


Hours 


3e 


Germinating 
Seeds 


Arabidopsis 


108462 


Day 2 


Timepoint 


Hours 


3e 


Germinating 
Seeds 


Arabidopsis 


108463 


Day 3 


Timepoint 


Hours 


3e 


Germinating 
Seeds 


Arabidopsis 


108464 


Day 4 


Timepoint 


Hours 


3bb 


Herbicide V3.1 


Arabidopsis 


108465 


Round up 


Treatment 


Compound 










12 


Timepoint 


Hours 


3bb 


Herbicide V3.1 


Arabidopsis 


108466 


Trimec 


Treatment 


Compound 










12 


Timepoint 


Hours 


3bb 


Herbicide V3.1 


Arabidopsis 


108467 


Finale 


Treatment 


Compound 










12 


Timepoint 


Hours 


3bb 


Herbicide V3.1 


Arabidopsis 


108468 


Glean 


Treatment 


Compound 










12 


Timepoint 


Hours 


3bb 


Herbicide_v2 


Arabidopsis 


107871 


Finale 


Treatment 


Compound 










4 


Timepoint 


Hours 


3bb 


Herbicide_v2 


Arabidopsis 


107876 


Finale 


Treatment 


Compound 










12 


Timepoint 


Hours 


3bb 


Herbicide_v2 


Arabidopsis 


107881 


Glean 


Treatment 


Compound 










4 


Timepoint 


Hours 


3bb 


Herbicide_v2 


Arabidopsis 


107886 


Trimec 


Treatment 


Compound 










4 


Timepoint 


Hours 


3bb 


Herbicide_v2 


Arabidopsis 


107891 


Trimec 


Treatment 


Compound 










12 


Timepoint 


Hours 


3bb 


Herbicide_v2 


Arabidopsis 


107896 


Round-up 


Treatment 


Compound 










4 


Timepoint 


Hours 


3d 


Trichome 

Inflorescences 

expt 


Arabidopsis 


108452 


Hairy 

Influorescenc 
e#l 


Tissue 


Tissue 


3o 


SA treatment_l 


Arabidopsis 


108471 


Columbia 


Species 


Hours 
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hour 






















1 


Timepoint 


Hours 












SA 


Treatment 


Compound 




3o 


SA treatment_l 


Arabidopsis 


108472 


CS3726 


Species 


Hours 






hour 






















1 


Timepoint 


Hours 












SA 


Treatment 


Compound 




3o 


SA treatment_4 


Arabidopsis 


108469 


Columbia 


Species 


Hours 






hour 






















4 


Timepoint 


Hours 












SA 


Treatment 


Compound 




3o 


SA treatment_4 


Arabidopsis 


108470 


CS3726 


Species 


Hours 






hour 






















SA 


Treatment 


Compound 












4 


Timepoint 


Hours 




3o 


SA 


Arabidopsis 


107953 


50 


Probe 


%of 






treatment_AJ 








Amount 


Standard 
















Amount 












SA 


Treatment 


Compound 












24 


Timepoint 


Hours 












Clontech 


Probe Type 


Probe 
















method 




3o 


SA 


Arabidopsis 


107960 


50 


Probe 


%of 


T; : 

; k : 




treatment_AJ 








Amount 


Standard 


: PI 














Amount 


0*1 










SA 


Treatment 


Compound 












24 


Timepoint 


Hours 


t : 

Pi 2 -. 
! : : 










Operon 


Probe Type 


Probe 


E tr 














method 




3o 


SA_treatment 


Arabidopsis 


108443 


SA 


Treatment 


Compound 


.ess. 
: : 




24 hour 






















24 


Timepoint 


Hours 




3o 


SA_treatment 6 


Arabidopsis 


108440 


SA treatment 


Treatment 


Compound 






hour 






6 hour 
















CS3726 


species 


Hours 




3o 


SA_treatment 6 


Arabidopsis 


108441 


SA treatment 


Treatment 


Compound 






hour 






6 hour 
















Columbia 


species 


Hours 




31 


Nitrogen High 


Arabidopsis 


108454 


10 min 


Timepoint 


Hours 






transition to 
















Low 














31 


Nitrogen High 


Arabidopsis 


108455 


1 hr 


Timepoint 


Hours 






transition to 
















Low 














3j 


BR_Shoot 


Arabidopsis 


108478 


dwf4-l 


Plant Line 


Hours 






Apices Expt 














3j 


BR_Shoot 


Arabidopsis 


108479 


AOD4-4 


Plant Line 


Hours 






Apices Expt 
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4: 



3j 


BR_Shoot 
Apices Expt 


Arabidopsis 


108480 


Ws-2 


Plant Line 


Hours 










BL 


Treatment 


Compound 


3j 


BR_Shoot 
Apices Expt 


Arabidopsis 


108481 


Ws-2 


Plant Line 


Hours 










BRZ 


Treatment 


Compound 


3jj 


Tissue Specific 
Expression 


Arabidopsis 


108429 


green flower 


Tissue 


Tissue 










operon 


Probe Type 


Probe 
method 










50 


Probe 
Amount 


%of 

Standard 

Amount 


3jj 


Tissue Specific 
Expression 


Arabidopsis 


108430 


white flower 


Tissue 


Tissue 










50 


Probe 
Amount 


%of 

Standard 

Amount 










operon 


Probe Type 


Probe 
method 


3jj 


Tissue Specific 
Expression 


Arabidopsis 


108431 


flowers (bud) 


Tissue 


Tissue 










operon 


Probe Type 


Probe 
method 










50 


Probe 
Amount 


%of 

Standard 

Amount 


3c 


Tissue Specific 
Expression 


Arabidopsis 


108436 


5-10mm 
siliques 


Tissue 


Tissue 










33 


Probe 
Amount 


%of 

Standard 

Amount 










operon 


Probe Type 


Probe 
method 


3c 


Tissue Specific 
Expression 


Arabidopsis 


108437 


<5mm 
siliques 


Tissue 


Tissue 










operon 


Probe Type 


Probe 
method 










33 


Probe 
Amount 


%of 

Standard 

Amount 


3c 


Tissue Specific 
Expression 


Arabidopsis 


108438 


5wk siliques 


Tissue 


Tissue 










33 


Probe 
Amount 


%of 

Standard 

Amount 










operon 


Probe Type 


Probe 
method 
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3a 


Tissue Specific 
Expression 


Arabidopsis 


108439 


Roots (2wk) 


Tissue 


Tissue 












operon 


Probe Type 


Probe 
method 












33 


Probe 
Amount 


%of 

Standard 

Amount 




3c 


Tissue Specific 
Expression 


Arabidopsis 


108497 


3 week 

Rossette 

leaves 


Tissue 


Tissue 












100 


Probe 
Amount 


%of 

Standard 

Amount 












operon 


Probe Type 


Probe 
method 




3c 


Tissue Specific 
Expression 


Arabidopsis 


108498 


3 -week stems 


Tissue 


Tissue 












operon 


Probe Type 


Probe 
method 


—Z. 










100 


Probe 


%of 


. f : : 












Amount 


Standard 


. d^: 

xiL.: 














Amount 




3dd 


U.A.E. 


Arabidopsis 


108451 


13B12 


Plant Line 


Hours 


! L! 




Knockout 














3q 


Ws Arabidopsis 
Drought 2 days 


Arabidopsis 


108477 


stems and 
leaves 


Tissue 


Tissue 












2 days 


Timepoint 


Hours 




3q 


Ws Arabidopsis 


Arabidopsis 


108482 


4 days 


Timepoint 


Hours 


bp 




Drought 4 days 












s t : 
s ti- 


3q 


Ws Arabidopsis 
Drought 6 days 


Arabidopsis 


108483 


6 days 


Timepoint 


Hours 


ll : 


3cc 


ap2-floral buds 


Arabidopsis 


108501 


ap2 (Ler.) 


Plant Line 


Hours 












floral buds 


Tissue 


Tissue 




3m 


nitrogen-seed 
set 


Arabidopsis 


108487 


0.5 


Timepoint 


Hours 




3m 


nitrogen-seed 
set 


Arabidopsis 


108488 


2 


Timepoint 


Hours 




3m 


nitrogen-seed 
set 


Arabidopsis 


108489 


4 


Timepoint 


Hours 




3b 


rhl mutant2 


Arabidopsis 


108433 


mutant 


Tissue 


Tissue 




3ee 


root tips 


Arabidopsis 


108434 


root tips 


Tissue 


Tissue 




3f 


stm mutants 


Arabidopsis 


108435 


stem 


Tissue 


Tissue 






Aluminum 




SMD 
7304, 
SMD 7305 












Axel 




SMD 
6654, 
SMD 6655 
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Cadium 




SMD 
7427, 

SMD 7428 










Cauliflower 




SMD 
5329, 

SMD 5330 










Chloroplast 




SMD 
8093, 
SMD 8094 










Circadian 




SMD 
2344, 
SMD 
2359, 
SMD 
2361, 
SMD 
2362, 
SMD 
2363, 
SMD 
2364, 
SMD 
2365, 
SMD 
2366, 
SMD 
2367, 
SMD 
2368, 
SMD 3242 










C02 




SMD7561, 

SMD 

7562, 

SMD 

7261, 

SMD 

7263, 

SMD 

3710, 

SMD 

4649, 

SMD 4650 










Disease 




SMD 
7342, 
SMD 7343 










reactive oxygen 




SMD 
7523 
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Iron 




SMD 
7114, 
SMD 
7115, 
SMD 7125 










defense 




SMD 
8031, 
SMD 8032 










Mitchondria- 

Electron 

Transport 




SMD 
8061, 
SMD 8063 










NAA 




SMD 
3743, 
SMD 
3749, 
SMD 
6338, 
SMD 6339 










Nitrogen 




SMD 
3787, 
SMD 3789 










Phototropism 




SMD 
4188, 
SMD 
6617, 
SMD 6619 










Shade 




SMD 
8130, 

SMD 7230 










Sqn 




SMD 
7133, 

SMD 7137 










Sulfur 




SMD 
8034, 

SMD 8035 










Wounding 




SMD 
3714, 
SMD 3715 










Zinc 




SMD 
7310, 
SMD 7311 









6. MA Clusters Table 

Microarray data was clustered using one of two methods: "complete linkage" or 
"nearest neighbor" analysis. These clustering methods are described in more detail elsewhere 
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herein. The results of the clustering analysis are presented in the MA_clust table. The table 
is organized as follows: 

"METHOD" refers to a method number which clustering method used. 
"CL_METHOD_TYPE=TRUE" refers to complete linkage method. 
5 <c NN_METHOD_TYPE=TRUE" refers to the nearest neighbor method. 

"FULL_NN_METHOD_TYPE=TRUE" refers to the nearest neighbor method, where no size 
limitation was placed on the cluster. 

"PARAMETERS" refers to the parameters utilized for the analysis. The nature of 
these is also described in more detail elsewhere herein. 
10 "ORGANISM" refers to the cDNA spotted on the chip were similar to Arabidopsis 

thaliana (3769) sequences or whether the oligo used for the chips were similar to Zea mays 
(311 987)sequences. 

Each cluster or group of cDNA is identified by a "Group #", following which are the 
C individual cDNA_Ids that are a member of that Group 

ft 

"P 7. Knock-in Table 



tar 



p ; j The Knock-In Table presents the results of knock-in experiments wherein plants are 

jji 

grown from tissues transformed with a marker gene-containing insert and phenotypes are 
^ ascertained from the transformed plants. Each section of the Table relating to information on 
lo a new transformant begins with a heading "Knock-in phenotype in gene (cDNA_id):" 
followed by a number which represents the Ceres internal code for a proprietary cDNA 
sequence. The described transformant was prepared by procedures described herein, wherein 
the identified Ceres proprietary cDNA_id (corresponding to the cDNA_id in the Reference 
and Sequence Tables) was interrupted by the marker gene-containing insert.The following 
25 information is presented for each section. 

• Parent plants used in cross - presents the id numbers of the parent plants 
which were crossed to produce the Fl generation plant for which a phenotype 
is described. The parent plant with the promoter is described by a plant line 
descriptor. 

30 • Clone ID - presents the clone number of the Ceres proprietary clone which 

was the source of the cDNA_id. 

• Phenotype ID - represents an internal identification code. 
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• Unique FI plant ID - represents the internal code for the Fl plant for which a 
phenotype is described. 

• Assay - presents the type of growth analyzed (e.g. soil gross morphology), 
followed by the assay name which corresponds to the type/location of the 
tissue that was obsereved., the name of the assay conducted for which the 
result provided the identified phenotype. 

• Phenotype - describes the phenotype noted for the Fl generation transformant. 

• Notes - may provide additional information on the described phenotype for 
the transformant. 



Each knock-in representing a transformant with an interruption in the identified 
cDNA_id may be correlated with more than one identified phenotype. 

8. Knock-out Table 

The Knock-Out Table presents the results of knock-out experiments wherein plants 
are grown from tissues transformed with a marker gene-containing insert wherein phenotypes 
are ascertained from the transformed plants. Each section of the Table relating to information 
on a new transformant begins with a heading "tail id:" representing an internal code. The 
following information is presented for each section. 

br - provides another internal code for the experiment. 

Phenotype_ id - provides an identification number for the particular phenotype 

identified for the transformant. 

assay - identifies the assay procedure utilized in the experiment to identify a 
phenotype for the transformant. 

phenotype - represents an internatl identification code. 

ratio - represents a segregation ratio. 

notes - lists any notes relevant to the identified phenotype. 

Knock-out in-genes - Identifies the genes in which the tag has inserted 



6) 
7) 
8) 
9) 



the less than 501 upstream of the transcriptional start site; 
less than 701 upstream of the translational initiation codon; 
between the translational initiation and termination codons of the gene, 
less than 301 downstream of the translational stop codon; or 
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10) less than 151 downstream of a transcriptional termination site or a 



In this table the gene is identified by its cDNA ID number, the Ceres SEQ ID that is 
indicated in the (Ac) portion of the Reference tables. For each cDNA_id, the 
following information is provided: 



the cDNA_id number. 

in parenthesis, the cluster number of which the identified cDNA is a 
member. 

the "gDNA_Insert pos" representing the position of the insert in the 
corresponding gDNA sequence 

the gi nimber refers to the TIGR chromosome sequences for 
Arabidopsis. 



by cDNA_id which are the same as those identified in the Reference and Sequence 
Tables) which are closest in position to the insert, both upstream and downstream 
from the insert. For each cDNA_id, the following information is provided: 



In the first parentheses, R indicates that the gene is to the right of the 
tag, L indicates that the gene is to right of the tag as the sequences is 
read left to right 
the cDNA_id number 

in next parentheses, the cluster number of which the identified cDNA 
is a member. 

the distance (in number of nucleotides) of the insert is upstream of the 
start of the gene annotation as described in the Reference Tables or 
downstream at the end the gene annotation. 

the "gDNA_Insert pos" representing the position of the insert in the 
corresponding gDNA sequence 

the gi nimber refers to the TIGR chromosome sequences for 
Arabidopsis. 



gene. 



Knock-out out of-genes: 



Identifies the Ceres cDNA proprietary sequences (noted 



9. 



Protein Domain Table 
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The Protein Domain table provides details concerning the protein domains noted in 
the Reference Table. The majority of the protein domain descriptions given in the Protein 
Domain Table are obtained from Prosite, (http//www.expasy.ch/prosite/), and Pfam, 
(http//pfam.wustl.edi^rowse.shtml). Each description in The Table begins with the pfam and 
Prosite identifying numbers, the full name of the domain, and a detailed description, 
including biological and in vivo implications/functions for the domain, references which 
further describe such implications/functions, and references that describe tests/assays to 
measure the implications/functions. 

10. Single Gene Functions & Utilities Table 

The Single Gene Functions & Utilities Table describes particular utilities/functions of 
interest for individual genes. The Table identifies the cDNA_ID of interest, correlates to that 
cDNA the relevant phenotype, protein domain and microarray/differential expression data. 
The final column of the Table identifies the utilities/functions of particular interest for the 
identified cDNA. 

1 1 . Cluster Functions & Utilities Table 

The Cluster Functions & Utilities Table describes particular utilities/functions of 
interest for identified clusters of genes. The Table provides the following information: 
Record # - an internal identifier. 

Goup - identifies the group of clusters of interest, wherein each group is identified 
with the same utilities/funcions as set forth in the right-hand most column. 

CDNA - identifies the cDNA of interest with the noted utility/function. 
CDNA_Cluster - identifies the cDNA Cluster ID of interest. 
Gi No - refers to the public genomic sequence that matches to the cDNA 
NR Hit - refers to the most relevant protein domain for the cDNA of interest. 
Pfam and Pfam Desc - provide the protein domain name. 

Notes/Annotations - provides some notes relevant to the data/information analysis. 
Utilities/Functions - this rightmost column identifies utilities/functions of particular 
interest for the group of cDNAs and clusters. 

12. cDNA Clusters Table 
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The cDNA_Clusters Table correlates the Ceres cDNA_ED nos. (in numerical order) 
with the relevant cDNA cluster which contains each cDNA_ID. 

13. Stanford old new cDNA map Table 

During the course of the experiments reported herein, some of the cDNA sequences 
were assigned new Ceres internal cDNA_id numbers. The cDNA_map Table provides a list 
of the original "old" cDNA_ids and correlates those id numbers with any new cDNA_id 
which may have been assigned. Thus, any "old" and "new" cDNA ids which are on the same 
line in the Table are, in fact, the same sequence. 

14. gb Only Peptides Table 

In the Protein Group table, a number of proteins encoded by Genbank predictions are 
included. These proteins were referenced with a peptide ID number. The peptide ID number 
is linked to the amino acid sequence of the Genbank prediction in this table. 

15. Stanford Old New cDNA Table 

During the course of the experiments reported herein, some of the cDNA sequences 
utilized in the Stanford Microarray differential expression analysis experiments were 
assigned new Ceres internal cDNA_id numbers. The Stanford_old_new_cDNA Table 
provides a list of the original "old" cDNA_ids and correlates those id numbers with any new 
cDNA_id which may have been assigned. Thus, any "old" and "new" cDNA ids- which are on 
the same line in the Table are, in fact, the same sequence. 

16. Enhanced Amino Table 

This table list the peptide IDs of polypeptides with enhanced amino acid content. The 
table list the peptide ID following with the single letter code of the amino acid that is 
enhanced. The table also includes a frequency that the amino acid occurred. The frequency 
was calculated by dividing the total number of the desired amino acid indicated in the column 
by the number of residues in the peptide. For example, if amino acid A, occurred 50 times in 
a polypeptide that is 100 amino acid long, the frequency would be 50 divided by 100 or 0.5. 

17. Stanford old new cDNA map Table* *Keep only for patent #3 
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During the course of the experiments reported herein, some of the cDNA sequences were 
assigned new Ceres internal cDNA_id numbers. The docket_80090_101_cDNA_map 
provides a list of the original "old" cDNA_ids in the Reference and Sequence tables and 
correlates those id numbers with any new cDNA_id which may have been assigned and 
utilized in the remaining tables. Thus, any "old" and "new" cDNA ids which are on the same 
line in the Table are, in fact, the same sequence. 
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HOW TO MAKE DIFFERENT EMBODIMENTS OF THE INVENTION 

The invention relates to (I) polynucleotides and methods of use thereof, such as 

IA. Probes, Primers and Substrates; 

IB. Methods of Detection and Isolation; 
B.l. Hybridization; 

B.2. Methods of Mapping; 

B.3. Southern Blotting; 

B.4. Isolating cDNA from Related Organisms; 

B. 5. Isolating and/or Identifying Orthologous Genes 

IC. Methods of Inhibiting Gene Expression 

C. I. Antisense 

C.2. Ribozyme Constructs; 
C . 3 . Chimeraplasts; 
C.4 Co-Suppression; 
C.5. Transcriptional Silencing 
C.6. Other Methods to Inhibit Gene Expression 
ED. Methods of Functional Analysis; 

IE. Promoter Sequences and Their Use; 

IF. UTRs and/or Intron Sequences and Their Use; and 

IG. Coding Sequences and Their Use. 

The invention also relates to (II) polypeptides and proteins and methods of use thereof, 



IIA. Native Polypeptides and Proteins 

A.l Antibodies 

A. 2 In Vitro Applications 

IIB. Polypeptide Variants, Fragments and Fusions 

B. l Variants 
B.2 Fragments 
B.3 Fusions 

The invention also includes (EI) methods of modulating polypeptide production, such as 
ULA. Suppression 



such as 
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A.l Antisense 
A.2 Ribozymes 
A.3 Co-suppression 

A.4 Insertion of Sequences into the Gene to be Modulated 
A. 5 Promoter Modulation 

A. 6 Expression of Genes containing Dominant-Negative Mutations 
mB. Enhanced Expression 

B. l Insertion of an Exogenous Gene 
B.2 Promoter Modulation 

The invention further concerns (IV) gene constructs and vector construction, such as 
IVA. Coding Sequences 
IVB. Promoters 
IVC. Signal Peptides 

The invention still further relates to 
V. Transformation Techniques 

L Polynucleotides 

Exemplified SDFs of the invention represent fragments of the genome of corn, wheat, 
rice, soybean or Arabidopsis and/or represent mRNA expressed from that genome. The isolated 
nucleic acid of the invention also encompasses corresponding fragments of the genome and/or 
cDNA complement of other organisms as described in detail below. 

Polynucleotides of the invention can be isolated from polynucleotide libraries using 
primers comprising sequence similar to those described by or Reference, Sequence or 
polynucleotides that encode sequences Protein Group, and Protein Group Matrix tables or 
complements thereof. See, for example, the methods described in Sambrook et al. 9 supra. 

Alternatively, the polynucleotides of the invention can be produced by chemical 
synthesis. Such synthesis methods are described below. 

It is contemplated that the nucleotide sequences presented herein may contain some 
small percentage of errors. These errors may arise in the normal course of determination of 
nucleotide sequences. Sequence errors can be corrected by obtaining seeds deposited under 
the accession numbers cited above, propagating them, isolating genomic DNA or appropriate 
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mRNA from the resulting plants or seeds thereof, amplifying the relevant fragment of the 
genomic DNA or mRNA using primers having a sequence that flanks the erroneous 
sequence, and sequencing the amplification product. 

LA. Probes, Primers and Substrates 

SDFs of the invention can be applied to substrates for use in array applications such 
as, but not limited to, assays of global gene expression, for example under varying conditions 
of development, growth conditions. The arrays can also be used in diagnostic or forensic 
methods (WO95/35505, US 5,445,943 and US 5,410,270). 

Probes and primers of the instant invention will hybridize to a polynucleotide 
comprising a sequence in or encoded by those in the Reference, Sequence, Protein Group, 
and Protein Group Matrix tables or fragments or complement thereof. Though many 
different nucleotide sequences can encode an amino acid sequence, the sequences of the 
reference and Sequence table or sequences that encode polypeptides or fragments thereof 
described in Protein Group and Protein Group Matrix tables are generally preferred for 
encoding polypeptides of the invention. However, the sequence of the probes and/or primers 
of the instant invention need not be identical to those in the Reference and Sequence tables 
or the complements thereof. For example, some variation in probe or primer sequence and/or 
length can allow additional family members to be detected, as well as orthologous genes and 
more taxonomically distant related sequences. Similarly, probes and/or primers of the 
invention can include additional nucleotides that serve as a label for detecting the formed 
duplex or for subsequent cloning purposes. 

Probe length will vary depending on the application. For use as primers, probes are 
12-40 nucleotides, preferably 18-30 nucleotides long. For use in mapping, probes are 
preferably 50 to 500 nucleotides, preferably 100-250 nucleotides long. For Southern 
hybridizations, probes as long as several kilobases can be used as explained below. 

The probes and/or primers can be produced by synthetic procedures such as the 
triester method of Matteucci et al. Am. Chem, Soc. 103:3185( 1981); or according to Urdea 
et al. Proc. Natl Acad. 80:7461 (1981) or using commercially available automated 
oligonucleotide synthesizers. 

LB. Methods of Detection and Isolation 
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The polynucleotides of the invention can be utilized in a number of methods known to 
those skilled in the art as probes and/or primers to isolate and detect polynucleotides, 
including, without limitation: Southerns, Northerns, Branched DNA hybridization assays, 
polymerase chain reaction, and microarray assays, and variations thereof. Specific methods 
given by way of examples, and discussed below include: 

Hybridization 

Methods of Mapping 

Southern Blotting 

Isolating cDNA from Related Organisms 

Isolating and/or Identifying Orthologous Genes. 
Also, the nucleic acid molecules of the invention can used in other methods, such as high 
density oligonucleotide hybridizing assays, described, for example, in U.S. Pat. Nos. 
6,004,753; 5,945,306; 5,945,287; 5,945,308; 5,919,686; 5,919,661; 5,919,627; 5,874,248; 
5,871,973; 5,871,971; and 5,871,930; and PCT Pub. Nos. WO 9946380; WO 9933981; WO 
9933870; WO 9931252; WO 9915658; WO 9906572; WO 9858052; WO 9958672; and WO 
9810858. 

B.l. Hybridization 

The isolated SDFs of the Reference and Sequence tables or SDFs encoding 
polypeptides of the Protein Group and Protein Group Matrix tables or fragments thereof of 
the present invention can be used as probes and/or primers for detection and/or isolation of 
related polynucleotide sequences through hybridization. Hybridization of one nucleic acid to 
another constitutes a physical property that defines the subject SDF of the invention and the 
identified related sequences. Also, such hybridization imposes structural limitations on the 
pair. A good general discussion of the factors for determining hybridization conditions is 
provided by Sambrook et al. ("Molecular Cloning, a Laboratory Manual, 2nd ed., c. 1989 by 
Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY; see esp., chapters 1 1 and 12). 
Additional considerations and details of the physical chemistry of hybridization are provided by 
G.H. Keller and M.M. Manak "DNA Probes", 2 nd Ed. pp. 1-25, c. 1993 by Stockton Press, New 
York, NY. 

Depending on the stringency of the conditions under which these probes and/or primers 
are used, polynucleotides exhibiting a wide range of similarity to those in the Reference and 
Sequence or encoding polypeptides of the Protein Group and Protein Group Matrix tables or 
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fragments thereof can be detected or isolated. When the practitioner wishes to examine the 
result of membrane hybridizations under a variety of stringencies, an efficient way to do so is 
to perform the hybridization under a low stringency condition, then to wash the hybridization 
membrane under increasingly stringent conditions. 

When using SDFs to identify orthologous genes in other species, the practitioner will 
preferably adjust the amount of target DNA of each species so that, as nearly as is practical, 
the same number of genome equivalents are present for each species examined. This 
prevents faint signals from species having large genomes, and thus small numbers of genome 
equivalents per mass of DNA, from erroneously being interpreted as absence of the 
corresponding gene in the genome. 

The probes and/or primers of the instant invention can also be used to detect or isolate 
nucleotides that are "identical" to the probes or primers. Two nucleic acid sequences or 
polypeptides are said to be "identical" if the sequence of nucleotides or amino acid residues, 
respectively, in the two sequences is the same when aligned for maximum correspondence as 
described below. 

Isolated polynucleotides within the scope of the invention also include allelic variants of 
the specific sequences presented in the Reference, Sequence, Protein Group, and Protein Group 
Matrix tables. The probes and/or primers of the invention can also be used to detect and/or 
isolate polynucleotides exhibiting at least 80% sequence identity with the sequences of the 
reference, Sequence or encoding polypeptides of the Protein Group and Protein Group Matrix 
tables or fragments thereof. 

With respect to nucleotide sequences, degeneracy of the genetic code provides the 
possibility to substitute at least one base of the base sequence of a gene with a different base 
without causing the amino acid sequence of the polypeptide produced from the gene to be 
changed. Hence, the DNA of the present invention may also have any base sequence that has 
been changed from a sequence in the Reference, Sequence, Protein Group, and Protein Group 
Matrix tables by substitution in accordance with degeneracy of genetic code. References 
describing codon usage include: Carels et al. 9 J. Mol Evol 46: 45 (1998) and Fennoy et aL, 
Nucl Acids Res. 21(23) : 5294 (1993). 

B.2. Mapping 

The isolated SDF DNA of the invention can be used to create various types of genetic 
and physical maps of the genome of corn, Arabidopsis, soybean, rice, wheat, or other plants. 
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Some SDFs may be absolutely associated with particular phenotypic traits, allowing 
construction of gross genetic maps. While not all SDFs will immediately be associated with 
a phenotype, all SDFs can be used as probes for identifying polymorphisms associated with 
phenotypes of interest. Briefly, one method of mapping involves total DNA isolation from 
individuals. It is subsequently cleaved with one or more restriction enzymes, separated 
according to mass, transferred to a solid support, hybridized with SDF DNA and the pattern 
of fragments compared. Polymorphisms associated with a particular SDF are visualized as 
differences in the size of fragments produced between individual DNA samples after 
digestion with a particular restriction enzyme and hybridization with the SDF. After 
identification of polymorphic SDF sequences, linkage studies can be conducted. By using the 
individuals showing polymorphisms as parents in crossing programs, F2 progeny 
recombinants or recombinant inbreds, for example, are then analyzed. The order of DNA 
polymorphisms along the chromosomes can be determined based on the frequency with 
which they are inherited together versus independently. The closer two polymorphisms are 
together in a chromosome the higher the probability that they are inherited together. 
Integration of the relative positions of all the polymorphisms and associated marker SDFs can 
produce a genetic map of the species, where the distances between markers reflect the 
recombination frequencies in that chromosome segment. 

The use of recombinant inbred lines for such genetic mapping is described for 
Arabidopsis by Alonso-Blanco et al. {Methods in Molecular Biology, vol.82, "Arabidopsis 
Protocols", pp. 137-146, J.M. Martinez-Zapater and J. Salinas, eds., c. 1998 by Humana 
Press, Totowa, NJ) and for corn by Burr ("Mapping Genes with Recombinant Inbreds", pp. 
249-254. In Freeling, M. and V. Walbot (Ed.), The Maize Handbook, c. 1994 by Springer- 
Verlag New York, Inc.: New York, NY, USA; Berlin Germany; Burr et al. Genetics (1998) 
118: 519; Gardiner, J. et al., (1993) Genetics 134: 917). This procedure, however, is not 
limited to plants and can be used for other organisms (such as yeast) or for individual cells. 

The SDFs of the present invention can also be used for simple sequence repeat (SSR) 
mapping. Rice SSR mapping is described by Morgante et al. {The Plant Journal (1993) 3: 
165), Panaud et al. {Genome (1995) 38: 1 170); Senior et al. {Crop Science (1996) 36: 1676), 
Taramino et al. {Genome (1996) 39: 277) and Ahn et al. {Molecular and General Genetics 
(1993) 241: 483-90). SSR mapping can be achieved using various methods. In one instance, 
polymorphisms are identified when sequence specific probes contained within an SDF 
flanking an SSR are made and used in polymerase chain reaction (PCR) assays with template 
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DNA from two or more individuals of interest. Here, a change in the number of tandem 
repeats between the SSR-flanking sequences produces differently sized fragments (U.S. 
Patent 5,766,847). Alternatively, polymorphisms can be identified by using the PCR 
fragment produced from the SSR-flanking sequence specific primer reaction as a probe 
against Southern blots representing different individuals (U.H. Refseth et al., (1997) 
Electrophoresis 18: 1519). 

Genetic and physical maps of crop species have many uses. For example, these maps 
can be used to devise positional cloning strategies for isolating novel genes from the mapped 
crop species. In addition, because the genomes of closely related species are largely syntenic 
(that is, they display the same ordering of genes within the genome), these maps can be used 
to isolate novel alleles from relatives of crop species by positional cloning strategies. 

The various types of maps discussed above can be used with the SDFs of the 
invention to identify Quantitative Trait Loci (QTLs). Many important crop traits, such as the 
solids content of tomatoes, are quantitative traits and result from the combined interactions of 
several genes. These genes reside at different loci in the genome, oftentimes on different 
chromosomes, and generally exhibit multiple alleles at each locus. The SDFs of the 
invention can be used to identify QTLs and isolate specific alleles as described by de Vicente 
and Tanksley {Genetics 134:585 (1993)). In addition to isolating QTL alleles in present crop 
species, the SDFs of the invention can also be used to isolate alleles from the corresponding 
QTL of wild relatives. Transgenic plants having various combinations of QTL alleles can 
then be created and the effects of the combinations measured. Once a desired allele 
combination has been identified, crop improvement can be accomplished either through 
biotechnological means or by directed conventional breeding programs (for review see 
Tanksley and McCouch, Science 277:1063 (1997)). 

In another embodiment, the SDFs can be used to help create physical maps of the 
genome of corn, Arabidopsis and related species. Where SDFs have been ordered on a 
genetic map, as described above, they can be used as probes to discover which clones in 
large libraries of plant DNA fragments in YACs, BACs, etc. contain the same SDF or similar 
sequences, thereby facilitating the assignment of the large DNA fragments to chromosomal 
positions. Subsequently, the large BACs, YACs, etc. can be ordered unambiguously by more 
detailed studies of their sequence composition (e.g. Marra et al. (1997) Genomic Research 
7:1072-1084) and by using their end or other sequences to find the identical sequences in 
other cloned DNA fragments. The overlapping of DNA sequences in this way allows large 
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contigs of plant sequences to be built that, when sufficiently extended, provide a complete 
physical map of a chromosome. Sometimes the SDFs themselves will provide the means of 
joining cloned sequences into a contig. 

The patent publication WO95/35505 and U.S. Patents 5,445,943 and 5,410,270 
describe scanning multiple alleles of a plurality of loci using hybridization to arrays of 
oligonucleotides. These techniques are useful for each of the types of mapping discussed 
above. 

Following the procedures described above and using a plurality of the SDFs of the 
present invention, any individual can be genotyped. These individual genotypes can be used 
for the identification of particular cultivars, varieties, lines, ecotypes and genetically modified 
plants or can serve as tools for subsequent genetic studies involving multiple phenotypic 
traits. 

B.3 Southern Blot Hybridization 

The sequences from Reference and Sequence and those encoding polypeptides of 
Protein Group and Protein Group Matrix tables or fragments thereof can be used as probes 
for various hybridization techniques. These techniques are useful for detecting target 
polynucleotides in a sample or for determining whether transgenic plants, seeds or host cells 
harbor a gene or sequence of interest and thus might be expected to exhibit a particular trait 
or phenotype. 

In addition, the SDFs from the invention can be used to isolate additional members of 
gene families from the same or different species and/or orthologous genes from the same or 
different species. This is accomplished by hybridizing an SDF to, for example, a Southern 
blot containing the appropriate genomic DNA or cDNA. Given the resulting hybridization 
data, one of ordinary skill in the art could distinguish and isolate the correct DNA fragments 
by size, restriction sites, sequence and stated hybridization conditions from a gel or from a 
library. 

Identification and isolation of orthologous genes from closely related species and 
alleles within a species is particularly desirable because of their potential for crop 
improvement. Many important crop traits, such as the solid content of tomatoes, result from 
the combined interactions of the products of several genes residing at different loci in the 
genome. Generally, alleles at each of these loci can make quantitative differences to the trait. 
By identifying and isolating numerous alleles for each locus from within or different species, 
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transgenic plants with various combinations of alleles can be created and the effects of the 
combinations measured. Once a more favorable allele combination has been identified, crop 
improvement can be accomplished either through biotechnological means or by directed 
conventional breeding programs (Tanksley et al. Science 277 :1063(1997)). 

The results from hybridizations of the SDFs of the invention to, for example, 
Southern blots containing DNA from another species can also be used to generate restriction 
fragment maps for the corresponding genomic regions. These maps provide additional 
information about the relative positions of restriction sites within fragments, further 
distinguishing mapped DNA from the remainder of the genome. 

Physical maps can be made by digesting genomic DNA with different combinations 
of restriction enzymes. 

Probes for Southern blotting to distinguish individual restriction fragments can range in 
size from 15 to 20 nucleotides to several thousand nucleotides. More preferably, the probe is 
100 to 1,000 nucleotides long for identifying members of a gene family when it is found that 
repetitive sequences would complicate the hybridization. For identifying an entire 
corresponding gene in another species, the probe is more preferably the length of the gene, 
typically 2,000 to 10,000 nucleotides, but probes 50-1,000 nucleotides long might be used. 
Some genes, however, might require probes up to 1,500 nucleotides long or overlapping probes 
constituting the full-length sequence to span their lengths. 

Also, while it is preferred that the probe be homogeneous with respect to its sequence, 
it is not necessary. For example, as described below, a probe representing members of a gene 
family having diverse sequences can be generated using PCR to amplify genomic DNA or 
RNA templates using primers derived from SDFs that include sequences that define the gene 
family. 

For identifying corresponding genes in another species, the next most preferable 
probe is a cDNA spanning the entire coding sequence, which allows all of the mRNA-coding 
fragment of the gene to be identified. Probes for Southern blotting can easily be generated 
from SDFs by making primers having the sequence at the ends of the SDF and using corn or 
Arabidopsis genomic DNA as a template. In instances where the SDF includes sequence 
conserved among species, primers including the conserved sequence can be used for PCR 
with genomic DNA from a species of interest to obtain a probe. 

Similarly, if the SDF includes a domain of interest, that fragment of the SDF can be 
used to make primers and, with appropriate template DNA, used to make a probe to identify 
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genes containing the domain. Alternatively, the PCR products can be resolved, for example 
by gel electrophoresis, and cloned and/or sequenced. Using Southern hybridization, the 
variants of the domain among members of a gene family, both within and across species, can 
be examined. 



B.4.1 Isolating DNA from Related Organisms 

The SDFs of the invention can be used to isolate the corresponding DNA from other 
organisms. Either cDNA or genomic DNA can be isolated. For isolating genomic DNA, a 
lambda, cosmid, BAC or YAC, or other large insert genomic library from the plant of interest 
can be constructed using standard molecular biology techniques as described in detail by 
Sambrook et al. 1989 (Molecular Cloning: A Laboratory Manual, 2 nd ed. Cold Spring Harbor 
Laboratory Press, New York) and by Ausubel et al. 1992 (Current Protocols in Molecular 
Biology, Greene Publishing, New York). 

To screen a phage library, for example, recombinant lambda clones are plated out on 
appropriate bacterial medium using an appropriate E. coli host strain. The resulting plaques 
are lifted from the plates using nylon or nitrocellulose filters. The plaque lifts are processed 
through denaturation, neutralization, and washing treatments following the standard protocols 
outlined by Ausubel et al. (1992). The plaque lifts are hybridized to either radioactively 
labeled or non-radioactively labeled SDF DNA at room temperature for about 16 hours, 
usually in the presence of 50% formamide and 5X SSC (sodium chloride and sodium citrate) 
buffer and blocking reagents. The plaque lifts are then washed at 42°C with 1% Sodium 
Dodecyl Sulfate (SDS) and at a particular concentration of SSC. The SSC concentration used 
is dependent upon the stringency at which hybridization occurred in the initial Southern blot 
analysis performed. For example, if a fragment hybridized under medium stringency (e.g., 
Tm - 20°C), then this condition is maintained or preferably adjusted to a less stringent 
condition (e.g., Tm-30°C) to wash the plaque lifts. Positive clones show detectable 
hybridization e.g., by exposure to X-ray films or chromogen formation. The positive clones 
are then subsequently isolated for purification using the same general protocol outlined 
above. Once the clone is purified, restriction analysis can be conducted to narrow the region 
corresponding to the gene of interest. The restriction analysis and succeeding subcloning 
steps can be done using procedures described by, for example Sambrook et al. (1989) cited 
above. 
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The procedures outlined for the lambda library are essentially similar to those used for 
YAC library screening, except that the YAC clones are harbored in bacterial colonies. The 
YAC clones are plated out at reasonable density on nitrocellulose or nylon filters supported 
by appropriate bacterial medium in petri plates. Following the growth of the bacterial clones, 
the filters are processed through the denaturation, neutralization, and washing steps following 
the procedures of Ausubel et al. 1992. The same hybridization procedures for lambda library 
screening are followed. 

To isolate cDNA, similar procedures using appropriately modified vectors are 
employed. For instance, the library can be constructed in a lambda vector appropriate for 
cloning cDNA such as Xgtl 1. Alternatively, the cDNA library can be made in a plasmid 
vector. cDNA for cloning can be prepared by any of the methods known in the art, but is 
preferably prepared as described above. Preferably, a cDNA library will include a high 
proportion of full-length clones. 

B. 5. Isolating and/or Identifying Orthologous Genes 
Probes and primers of the invention can be used to identify and/or isolate 
polynucleotides related to those in the Reference, Sequence, Protein Group, and Protein Group 
Matrix tables. Related polynucleotides are those that are native to other plant organisms and 
exhibit either similar sequence or encode polypeptides with similar biological activity. One 
specific example is an orthologous gene. Orthologous genes have the same functional activity. 
As such, orthologous genes may be distinguished from homologous genes. The percentage of 
identity is a function of evolutionary separation and, in closely related species, the percentage of 
identity can be 98 to 100%. The amino acid sequence of a protein encoded by an orthologous 
gene can be less than 75% identical, but tends to be at least75% or at least 80% identical, more 
preferably at least 90%, most preferably at least 95% identical to the amino acid sequence of the 
reference protein. 

To find orthologous genes, the probes are hybridized to nucleic acids from a species of 
interest under low stringency conditions, preferably one where sequences containing as much as 
40-45% mismatches will be able to hybridize. This condition is established by T m - 40°C to Tm 
- 48°C (see below). Blots are then washed under conditions of increasing stringency. It is 
preferable that the wash stringency be such that sequences that are 85 to 100% identical will 
hybridize. More preferably, sequences 90 to 100% identical will hybridize and most preferably 
only sequences greater than 95% identical will hybridize. One of ordinary skill in the art will 
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recognize that, due to degeneracy in the genetic code, amino acid sequences that are identical 
can be encoded by DNA sequences as little as 67% identical or less. Thus, it is preferable, for 
example, to make an overlapping series of shorter probes, on the order of 24 to 45 nucleotides, 
and individually hybridize them to the same arrayed library to avoid the problem of degeneracy 
introducing large numbers of mismatches. 

As evolutionary divergence increases, genome sequences also tend to diverge. Thus, 
one of skill will recognize that searches for orthologous genes between more divergent 
species will require the use of lower stringency conditions compared to searches between 
closely related species. Also, degeneracy of the genetic code is more of a problem for 
searches in the genome of a species more distant evolutionarily from the species that is the 
source of the SDF probe sequences. 

Therefore the method described in Bouckaert et al., U.S. Ser. No. 60/121,700 Atty. 
Dkt. No. 2750-1 17P, Client Dkt. No. 00010.001, filed February 25, 1999, hereby 
incorporated in its entirety by reference, can be applied to the SDFs of the present invention 
to isolate related genes from plant species which do not hybridize to the corn Arabidopsis, 
soybean, rice, wheat, and other plant sequences of the reference, Sequence, Protein Group, 
and Protein Group Matrix tables. 

Identification of the relationship of nucleotide or amino acid sequences among plant 
species can be done by comparing the nucleotide or amino acid sequences of SDFs of the 
present application with nucleotide or amino acid sequences of other SDFs such as those 
present in applications listed in the table below: 

The SDFs of the invention can also be used as probes to search for genes that are 
related to the SDF within a species. Such related genes are typically considered to be 
members of a gene family. In such a case, the sequence similarity will often be concentrated 
into one or a few fragments of the sequence. The fragments of similar sequence that define 
the gene family typically encode a fragment of a protein or RNA that has an enzymatic or 
structural function. The percentage of identity in the amino acid sequence of the domain that 
defines the gene family is preferably at least 70%, more preferably 80 to 95%, most 
preferably 85 to 99%. To search for members of a gene family within a species, a low 
stringency hybridization is usually performed, but this will depend upon the size, distribution 
and degree of sequence divergence of domains that define the gene family. SDFs 
encompassing regulatory regions can be used to identify coordinately expressed genes by 
using the regulatory region sequence of the SDF as a probe. 
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In the instances where the SDFs are identified as being expressed from genes that 
confer a particular phenotype, then the SDFs can also be used as probes to assay plants of 
different species for those phenotypes. 



5 I.C. Methods to Inhibit Gene Expression 

The nucleic acid molecules of the present invention can be used to inhibit gene 
transcription and/or translation. Example of such methods include, without limitation: 
Antisense Constructs; 
Ribozyme Constructs; 
10 Chimeraplast Constructs; 

Co-Suppression; 
Transcriptional Silencing; and 
Other Methods of Gene Expression. 

~f"l 

15 C.l Antisense 

£ In some instances it is desirable to suppress expression of an endogenous or 

Rj exogenous gene. A well-known instance is the FLAVOR-SAVOR™ tomato, in which the 
tl gene encoding ACC synthase is inactivated by an antisense approach, thus delaying softening 
5 _ of the fruit after ripening. See for example, U.S. Patent No. 5,859,330; U.S. Patent No. 
|0 5,723,766; Oeller, et al, Science, 254:437-439(1991); and Hamilton et al, Nature, 346:284- 
% 287 (1990). Also, timing of flowering can be controlled by suppression of the FLOWERING 
O LOCUS C (FLQ; high levels of this transcript are associated with late flowering, while 

absence of FLC is associated with early flowering (S.D. Michaels et al., Plant Cell JJ.:949 
(1999). Also, the transition of apical meristem from production of leaves with associated 
25 shoots to flowering is regulated by TERMINAL FLOWER1, APETALA1 and LEAFY, Thus, 
when it is desired to induce a transition from shoot production to flowering, it is desirable to 
suppress TFL1 expression (SJ. Liljegren, Plant Cell 11:1007 (1999)). As another instance, 
arrested ovule development and female sterility result from suppression of the ethylene 
forming enzyme but can be reversed by application of ethylene (D. De Martinis et al., Plant 
30 Cell U : 1061 (1999)). The ability to manipulate female fertility of plants is useful in 
increasing fruit production and creating hybrids. 

In the case of polynucleotides used to inhibit expression of an endogenous gene, the 
introduced sequence need not be perfectly identical to a sequence of the target endogenous gene. 
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The introduced polynucleotide sequence will typically be at least substantially identical to the 
target endogenous sequence. 

Some polynucleotide SDFs in the Reference, Sequence, Protein Group, and Protein 
Group Matrix tables represent sequences that are expressed in corn,wheat, rice, soybean 
Arabidopsis and/or other plants. Thus the invention includes using these sequences to 
generate antisense constructs to inhibit translation and/or degradation of transcripts of said 
SDFs, typically in a plant cell. 

To accomplish this, a polynucleotide segment from the desired gene that can hybridize to 
the mRNA expressed from the desired gene (the "antisense segment") is operably linked to a 
promoter such that the antisense strand of RNA will be transcribed when the construct is present 
in a host cell. A regulated promoter can be used in the construct to control transcription of the 
antisense segment so that transcription occurs only under desired circumstances. 

The antisense segment to be introduced generally will be substantially identical to at 
least a fragment of the endogenous gene or genes to be repressed. The sequence, however, need 
not be perfectly identical to inhibit expression. Further, the antisense product may hybridize to 
the untranslated region instead of or in addition to the coding sequence of the gene. The vectors 
of the present invention can be designed such that the inhibitory effect applies to other proteins 
within a family of genes exhibiting homology or substantial homology to the target gene. 

For antisense suppression, the introduced antisense segment sequence also need not be 
full length relative to either the primary transcription product or the fully processed mRNA. 
Generally, a higher percentage of sequence identity can be used to compensate for the use of a 
shorter sequence. Furthermore, the introduced sequence need not have the same intron or exon 
pattern, and homology of non-coding segments may be equally effective. Normally, a sequence 
of between about 30 or 40 nucleotides and the full length of the transcript canbe used, though a 
sequence of at least about 100 nucleotides is preferred, a sequence of at least about 200 
nucleotides is more preferred, and a sequence of at least about 500 nucleotides is especially 
preferred. 




C.2. Ribozymes 

It is also contemplated that gene constructs representing ribozymes and based on the 
SDFs in the Reference and Sequence tables or those encoding polypeptides of the Protein 
Group and Protein Group Matrix tables and fragment thereof are an object of the invention. 
Ribozymes can also be used to inhibit expression of genes by suppressing the translation of the 
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mRNA into a polypeptide. It is possible to design ribozymes that specifically pair with virtually 
any target RNA and cleave the phosphodiester backbone at a specific location, thereby 
functionally inactivating the target RNA. In carrying out this cleavage, the ribozyme is not itself 
altered, and is thus capable of recycling and cleaving other molecules, making it a true enzyme. 
The inclusion of ribozyme sequences within antisense RNAs confers RNA-cleaving activity 
upon them, thereby increasing the activity of the constructs. 

A number of classes of ribozymes have been identified. One class of ribozymes is 
derived from a number of small circular RNAs, which are capable of self-cleavage and 
replication in plants. The RNAs replicate either alone (viroid RNAs) or with a helper virus 
(satellite RNAs). Examples include RNAs from avocado sunblotch viroid and the satellite 
RNAs from tobacco ringspot virus, lucerne transient streak virus, velvet tobacco mottle virus, 
solanum nodiflorum mottle virus and subterranean clover mottle virus. The design and use of 
target RNA-specific ribozymes is described in Haseloff et al. Nature, 334:585 (1988). 

Like the antisense constructs above, the ribozyme sequence fragment necessary for 
pairing need not be identical to the target nucleotides to be cleaved, nor identical to the 
sequences in the Reference and Sequence tables or those encoding polypeptide of the Protein 
Group and Protein Group Matrix tables or fragments thereof. Ribozymes may be constructed 
by combining the ribozyme sequence and some fragment of the target gene which would 
allow recognition of the target gene mRNA by the resulting ribozyme molecule. Generally, 
the sequence in the ribozyme capable of binding to the target sequence exhibits a percentage 
of sequence identity with at least 80%, preferably with at least 85%, more preferably with at 
least 90% and most preferably with at least 95%, even more preferably, with at least 96%, 97%, 
98% or 99% sequence identity to some fragment of a sequence in the Reference, Sequence, 
Protein Group, and Protein Group Matrix tables or the complement thereof. The ribozyme 
can be equally effective in inhibiting mRNA translation by cleaving either in the untranslated 
or coding regions. Generally, a higher percentage of sequence identity can be used to 
compensate for the use of a shorter sequence. Furthermore, the introduced sequence need not 
have the same intron or exon pattern, and homology of non-coding segments may be equally 
effective. 

C.3. Chimeraplasts 

The SDFs of the invention, such as those described by Reference, Sequence, Protein 
Group, and Protein Group Matrix tables, can also be used to construct chimeraplasts that can 
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be introduced into a cell to produce at least one specific nucleotide change in a sequence 
corresponding to the SDF of the invention. A chimeraplast is an oligonucleotide comprising 
DNA and/or RNA that specifically hybridizes to a target region in a manner which creates a 
mismatched base-pair. This mismatched base-pair signals the cell's repair enzyme machinery 
which acts on the mismatched region resulting in the replacement, insertion or deletion of 
designated nucleotide(s). The altered sequence is then expressed by the cell's normal cellular 
mechanisms. Chimeraplasts can be designed to repair mutant genes, modify genes, introduce 
site-specific mutations, and/or act to interrupt or alter normal gene function (US Pat. Nos. 
6,010,907 and 6,004,804; and PCT Pub. No. W099/58723 and WO99/07865). 

C.4. Sense Suppression 

The SDFs of the reference, Sequence, Protein Group, and Protein Group Matrix tables 
of the present invention are also useful to modulate gene expression by sense suppression. 
Sense suppression represents another method of gene suppression by introducing at least one 
exogenous copy or fragment of the endogenous sequence to be suppressed. 

Introduction of expression cassettes in which a nucleic acid is configured in the sense 
orientation with respect to the promoter into the chromosome of a plant or by a self-replicating 
virus has been shown to be an effective means by which to induce degradation of mRNAs of 
target genes. For an example of the use of this method to modulate expression of endogenous 
genes see, Napoli et al., The Plant Cell 2:279 (1990), and U.S. Patents Nos. 5,034,323, 
5,231,020, and 5,283,184. Inhibition of expression may require some transcription of the 
introduced sequence. 

For sense suppression, the introduced sequence generally will be substantially identical 
to the endogenous sequence intended to be inactivated. The minimal percentage of sequence 
identity will typically be greater than about 65%, but a higher percentage of sequence identity 
might exert a more effective reduction in the level of normal gene products. Sequence identity 
of more than about 80% is preferred, though about 95% to absolute identity would be most 
preferred. As with antisense regulation, the effect would likely apply to any other proteins 
within a similar family of genes exhibiting homology or substantial homology to the suppressing 
sequence. 

C.5. Transcriptional Silencing 
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The nucleic acid sequences of the invention, including the SDFs of the reference, 
Sequence, Protein Group, and Protein Group Matrix tables, and fragments thereof, contain 
sequences that can be inserted into the genome of an organism resulting in transcriptional 
silencing. Such regulatory sequences need not be operatively linked to coding sequences to 
modulate transcription of a gene. Specifically, a promoter sequence without any other 
element of a gene can be introduced into a genome to transcriptionally silence an endogenous 
gene (see, for example, Vaucheret, H et al. (1998) The Plant Journal 16: 651-659). As another 
example, triple helices can be formed using oligonucleotides based on sequences from 
Reference, Sequence, Protein Group, and Protein Group Matrix tables, fragments thereof, and 
substantially similar sequence thereto. The oligonucleotide can be delivered to the host cell 
and can bind to the promoter in the genome to form a triple helix and prevent transcription. 
An oligonucleotide of interest is one that can bind to the promoter and block binding of a 
transcription factor to the promoter. In such a case, the oligonucleotide can be 
complementary to the sequences of the promoter that interact with transcription binding 
factors. 

C.6. Other Methods to Inhibit Gene Expression 

Yet another means of suppressing gene expression is to insert a polynucleotide into the 
gene of interest to disrupt transcription or translation of the gene. 

Low frequency homologous recombination can be used to target a polynucleotide insert 
to a gene by flanking the polynucleotide insert with sequences that are substantially similar to 
the gene to be disrupted. Sequences from Reference, Sequence, Protein Group, and Protein 
Group Matrix tables, fragments thereof, and substantially similar sequence thereto can be used 
for homologous recombination. 

In addition, random insertion of polynucleotides into a host cell genome can also be 
used to disrupt the gene of interest. Azpiroz-Leehan et al., Trends in Genetics .13:152 (1997). 
In this method, screening for clones from a library containing random insertions is preferred 
to identifying those that have polynucleotides inserted into the gene of interest. Such 
screening can be performed using probes and/or primers described above based on sequences 
from Reference, Sequence, Protein Group, and Protein Group Matrix tables, fragments 
thereof, and substantially similar sequence thereto. The screening can also be performed by 
selecting clones or Ri plants having a desired phenotype. 
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I.D. Methods of Functional Analysis 

The constructs described in the methods under I.C. above can be used to determine 
the function of the polypeptide encoded by the gene that is targeted by the constructs. 

Down-regulating the transcription and translation of the targeted gene in the host cell 
or organisms, such as a plant, may produce phenotypic changes as compared to a wild-type 
cell or organism. In addition, in vitro assays can be used to determine if any biological 
activity, such as calcium flux, DNA transcription, nucleotide incorporation, etc., are being 
modulated by the down-regulation of the targeted gene. 

Coordinated regulation of sets of genes, e.g., those contributing to a desired polygenic 
trait, is sometimes necessary to obtain a desired phenotype. SDFs of the invention 
representing transcription activation and DNA binding domains can be assembled into hybrid 
transcriptional activators. These hybrid transcriptional activators can be used with their 
corresponding DNA elements (i.e., those bound by the DNA-binding SDFs) to effect 
coordinated expression of desired genes (JJ. Schwarz et al., Mol Cell Biol Y2/266 (1992), 
A. Martinez et al., Mol Gen. Genet. 261:546 (1999)). 

The SDFs of the invention can also be used in the two-hybrid genetic systems to 
identify networks of protein-protein interactions (L. McAlister-Henn et al., Methods 1^9:330 
(1999), J.C. Hu et al., Methods 20:80 (2000), M. Golovkin et al., J. Biol Chem. 274:36428 
(1999), K. Ichimura et al., Biochem. Biophys. Res. Comm. 253:532 (1998)). The SDFs of the 
invention can also be used in various expression display methods to identify important 
protein-DNA interactions (e.g. B. Luo et al., J. Mol. Biol. 266:479 (1997)). 

I.E. Promoters 

The SDFs of the invention are also useful as structural or regulatory sequences in a 
construct for modulating the expression of the corresponding gene in a plant or other organism, 
e.g. a symbiotic bacterium. For example, promoter sequences associated to SDFs of the 
reference, Sequence, Protein Group, and Protein Group Matrix tables of the present invention 
can be useful in directing expression of coding sequences either as constitutive promoters or to 
direct expression in particular cell types, tissues, or organs or in response to environmental 
stimuli. 

With respect to the SDFs of the present invention a promoter is likely to be a relatively 
small portion of a genomic DNA (gDNA) sequence located in the first 2000 nucleotides 
upstream from an initial exon identified in a gDNA sequence or initial "ATG" or methionine 
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codon or translational start site in a corresponding cDNA sequence. Such promoters are more 
likely to be found in the first 1000 nucleotides upstream of an initial ATG or methionine codon 
or translational start site of a cDNA sequence corresponding to a gDNA sequence. In particular, 
the promoter is usually located upstream of the transcription start site. The fragments of a 
particular gDNA sequence that function as elements of a promoter in a plant cell will preferably 
be found to hybridize to gDNA sequences presented and described in the Reference table at 
medium or high stringency, relevant to the length of the probe and its base composition. 

Promoters are generally modular in nature. Promoters can consist of a basal promoter 
that functions as a site for assembly of a transcription complex comprising an RNA polymerase, 
for example RNA polymerase II. A typical transcription complex will include additional factors 
such as TFnB, TFnD, and TFnE. Of these, TF n D appears to be the only one to bind DNA 
directly. The promoter might also contain one or more enhancers and/or suppressors that 
function as binding sites for additional transcription factors that have the function of modulating 
the level of transcription with respect to tissue specificity and of transcriptional responses to 
particular environmental or nutritional factors, and the like. 

Short DNA sequences representing binding sites for proteins can be separated from each 
other by intervening sequences of varying length. For example, within a particular functional 
module, protein binding sites may be constituted by regions of 5 to 60, preferably 10 to 30, more 
preferably 10 to 20 nucleotides. Within such binding sites, there are typically 2 to 6 nucleotides 
that specifically contact amino acids of the nucleic acid binding protein. The protein binding 
sites are usually separated from each other by 10 to several hundred nucleotides, typically by 15 
to 150 nucleotides, often by 20 to 50 nucleotides. DNA binding sites in promoter elements often 
display dyad symmetry in their sequence. Often elements binding several different proteins, 
and/or a plurality of sites that bind the same protein, will be combined in a region of 50 to 1,000 
basepairs. 

Elements that have transcription regulatory function can be isolated from their 
corresponding endogenous gene, or the desired sequence can be synthesized, and recombined in 
constructs to direct expression of a coding region of a gene in a desired tissue-specific, temporal- 
specific or other desired manner of inducibility or suppression. When hybridizations are 
performed to identify or isolate elements of a promoter by hybridization to the long sequences 
presented in the Reference tables, conditions are adjusted to account for the above-described 
nature of promoters. For example short probes, constituting the element sought, are preferably 
used under low temperature and/or high salt conditions. When long probes, which might 
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include several promoter elements are used, low to medium stringency conditions are preferred 
when hybridizing to promoters across species. 

If a nucleotide sequence of an SDF, or part of the SDF, functions as a promoter or 
fragment of a promoter, then nucleotide substitutions, insertions or deletions that do not 
substantially affect the binding of relevant DNA binding proteins would be considered 
equivalent to the exemplified nucleotide sequence. It is envisioned that there are instances 
where it is desirable to decrease the binding of relevant DNA binding proteins to silence or 
down-regulate a promoter, or conversely to increase the binding of relevant DNA binding 
proteins to enhance or up-regulate a promoter and vice versa. In such instances, 
polynucleotides representing changes to the nucleotide sequence of the DNA-protein contact 
region by insertion of additional nucleotides, changes to identity of relevant nucleotides, 
including use of chemically-modified bases, or deletion of one or more nucleotides are 
considered encompassed by the present invention. In addition, fragments of the promoter 
sequences described by Reference tables and variants thereof can be fused with other 
promoters or fragments to facilitate transcription and/or transcription in specific type of cells 
or under specific conditions. 

Promoter function can be assayed by methods known in the art, preferably by 
measuring activity of a reporter gene operatively linked to the sequence being tested for 
promoter function. Examples of reporter genes include those encoding luciferase, green 
fluorescent protein, GUS, neo, cat and bar. 




LF. UTRs and Junctions 

Polynucleotides comprising untranslated (UTR) sequences and intron/exon junctions are 
also within the scope of the invention. UTR sequences include introns and 5' or 3' untranslated 
regions ( 5' UTRs or 3 5 UTRs). Fragments of the sequences shown in the Reference and 
Sequence tables can comprise UTRs and intron/exon junctions. 

These fragments of SDFs, especially UTRs, can have regulatory functions related to, for 
example, translation rate and mRNA stability. Thus, these fragments of SDFs can be isolated 
for use as elements of gene constructs for regulated production of polynucleotides encoding 
desired polypeptides. 

Introns of genomic DNA segments might also have regulatory functions. Sometimes 
regulatory elements, especially transcription enhancer or suppressor elements, are found 
within introns. Also, elements related to stability of heteronuclear RNA and efficiency of 
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splicing and of transport to the cytoplasm for translation can be found in intron elements. 
Thus, these segments can also find use as elements of expression vectors intended for use to 
transform plants. 

Just as with promoters UTR sequences and intron/exon junctions can vary from those 
5 shown in the Reference and Sequence tables. Such changes from those sequences preferably 
will not affect the regulatory activity of the UTRs or intron/exon junction sequences on 
expression, transcription, or translation unless selected to do so. However, in some instances, 
down- or up-regulation of such activity may be desired to modulate traits or phenotypic or in 
vitro activity. 

10 

I.G. Coding Sequences 

Isolated polynucleotides of the invention can include coding sequences that encode 
polypeptides comprising an amino acid sequence encoded by sequences described in the 
C Reference and Sequence tables or an amino acid sequence presented in the Reference, 
=K> Sequence, Protein Group, and Protein Group Matrix tables. 

J; A nucleotide sequence encodes a polypeptide if a cell (or a cell free in vitro system) 

fU expressing that nucleotide sequence produces a polypeptide having the recited amino acid 
#rj sequence when the nucleotide sequence is transcribed and the primary transcript is 
L subsequently processed and translated by a host cell (or a cell free in vitro system) harboring 

SO the nucleic acid. Thus, an isolated nucleic acid that encodes a particular amino acid sequence 

m 

V can be a genomic sequence comprising exons and introns or a cDNA sequence that represents 
^ the product of splicing thereof. An isolated nucleic acid encoding an amino acid sequence 

also encompasses heteronuclear RNA, which contains sequences that are spliced out during 

expression, and mRNA, which lacks those sequences. 
25 Coding sequences can be constructed using chemical synthesis techniques or by 

isolating coding sequences or by modifying such synthesized or isolated coding sequences as 

described above. 

In addition to coding sequences encoding the polypeptide sequences of the reference, 
Sequence, Protein Group, and Protein Group Matrix tables, which are native to corn, 
30 Arabidopsis, soybean, rice, wheat, and other plants, the isolated polynucleotides can be 

polynucleotides that encode variants, fragments, and fusions of those native proteins. Such 
polypeptides are described below in part II. 
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In variant polynucleotides generally, the number of substitutions, deletions or insertions 
is preferably less than 20%, more preferably less than 15%; even more preferably less than 10%, 
5%, 3% or 1% of the number of nucleotides comprising a particularly exemplified sequence. It 
is generally expected that non-degenerate nucleotide sequence changes that result in 1 to 10, 
more preferably 1 to 5 and most preferably 1 to 3 amino acid insertions, deletions or 
substitutions will not greatly affect the function of an encoded polypeptide. The most preferred 
embodiments are those wherein 1 to 20, preferably 1 to 10, most preferably 1 to 5 nucleotides 
are added to, or deleted from and/or substituted in the sequences specifically disclosed in the 
Reference and Sequence tables or polynucleotides that encode polypeptides of the Protein 
Group, and Protein Group Matrix tables or fragments thereof. 

Insertions or deletions in polynucleotides intended to be used for encoding a 
polypeptide preferably preserve the reading frame. This consideration is not so important in 
instances when the polynucleotide is intended to be used as a hybridization probe. 

II Polypeptides and Proteins 

IIA. Native polypeptides and proteins 

Polypeptides within the scope of the invention include both native proteins as well as 
variants, fragments, and fusions thereof. Polypeptides of the invention are those encoded by 
any of the six reading frames of sequences shown in the Reference and Sequence tables, 
preferably encoded by the three frames reading in the 5' to 3' direction of the sequences as 
shown. 

Native polypeptides include the proteins encoded by the sequences shown in the 
Reference and Sequence tables. Such native polypeptides include those encoded by allelic 
variants. 

Polypeptide and protein variants will exhibit at least 75% sequence identity to those 
native polypeptides of the Reference and Sequence tables. More preferably, the polypeptide 
variants will exhibit at least 85% sequence identity; even more preferably, at least 90% sequence 
identity; more preferably at least 95%, 96%, 97%, 98%, or 99% sequence identity. Fragments 
of polypeptide or fragments of polypeptides will exhibit similar percentages of sequence 
identity to the relevant fragments of the native polypeptide. Fusions will exhibit a similar 
percentage of sequence identity in that fragment of the fusion represented by the variant of the 
native peptide. 
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Polypeptide and protein variants of the invention will exhibit at least 75% sequence 
identity to those motifs or consensus sequences of the Protein Group and Protein Group Matrix 
tables. More preferably, the polypeptide variants will exhibit at least 85% sequence identity; 
even more preferably, at least 90% sequence identity; more preferably at least 95%, 96%, 97%, 
98%, or 99% sequence identity. Fragments of polypeptide or fragments of polypeptides will 
exhibit similar percentages of sequence identity to the relevant fragments of the native 
polypeptide that are indicated in the Protein Group table. Fusions will exhibit a similar 
percentage of sequence identity in that fragment of the fusion represented by the variant of the 
native peptide. 

Furthermore, polypeptide variants will exhibit at least one of the functional properties of 
the native protein. Such properties include, without limitation, protein interaction, DNA 
interaction, biological activity, immunological activity, receptor binding, signal transduction, 
transcription activity, growth factor activity, secondary structure, three-dimensional structure, 
etc. As to properties related to in vitro or in vivo activities, the variants preferably exhibit at least 
60% of the activity of the native protein; more preferably at least 70%, even more preferably at 
least 80%, 85%, 90% or 95% of at least one activity of the native protein. 

One type of variant of native polypeptides comprises amino acid substitutions, deletions 
and/or insertions. Conservative substitutions are preferred to maintain the function or activity of 
the polypeptide. 

Within the scope of percentage of sequence identity described above, a polypeptide of 
the invention may have additional individual amino acids or amino acid sequences inserted 
into the polypeptide in the middle thereof and/or at the N-terminal and/or C-terminal ends 
thereof. Likewise, some of the amino acids or amino acid sequences may be deleted from the 
polypeptide. 

A.l Antibodies 

Isolated polypeptides can be utilized to produce antibodies. Polypeptides of the invention 
can generally be used, for example, as antigens for raising antibodies by known techniques. The 
resulting antibodies are. useful as reagents for determining the distribution of the antigen protein 
within the tissues of a plant or within a cell of a plant. The antibodies are also useful for 
examining the production level of proteins in various tissues, for example in a wild-type plant or 
following genetic manipulation of a plant, by methods such as Western blotting. 
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Antibodies of the present invention, both polyclonal and monoclonal, may be prepared 
by conventional methods. In general, the polypeptides of the invention are first used to 
immunize a suitable animal, such as a mouse, rat, rabbit, or goat. Rabbits and goats are 
preferred for the preparation of polyclonal sera due to the volume of serum obtainable, and the 
availability of labeled anti-rabbit and anti-goat antibodies as detection reagents. Immunization is 
generally performed by mixing or emulsifying the protein in saline, preferably in an adjuvant 
such as Freund's complete adjuvant, and injecting the mixture or emulsion parenterally 
(generally subcutaneously or intramuscularly). A dose of 50-200 jig/injection is typically 
sufficient. Immunization is generally boosted 2-6 weeks later with one or more injections of the 
protein in saline, preferably using Freund's incomplete adjuvant. One may alternatively 
generate antibodies by in vitro immunization using methods known in the art, which for the 
purposes of this invention is considered equivalent to in vivo immunization. 

Polyclonal antisera is obtained by bleeding the immunized animal into a glass or plastic 
container, incubating the blood at 25 °C for one hour, followed by incubating the blood at 4°C 
for 2-18 hours. The serum is recovered by centrifugation (e.g., l,000xg for 10 minutes). About 
20-50 ml per bleed may be obtained from rabbits. 

Monoclonal antibodies are prepared using the method of Kohler and Milstein, Nature 
256: 495 (1975), or modification thereof. Typically, a mouse or rat is immunized as described 
above. However, rather than bleeding the animal to extract serum, the spleen (and optionally 
several large lymph nodes) is removed and dissociated into single cells. If desired, the spleen 
cells can be screened (after removal of nonspecifically adherent cells) by applying a cell 
suspension to a plate, or well, coated with the protein antigen. B-cells producing membrane- 
bound immunoglobulin specific for the antigen bind to the plate, and are not rinsed away with 
the rest of the suspension. Resulting B-cells, or all dissociated spleen cells, are then induced to 
fuse with myeloma cells to form hybridomas, and are cultured in a selective medium (e.g., 
hypoxanthine, aminopterin, thymidine medium, "HAT"). The resulting hybridomas are plated 
by limiting dilution, and are assayed for the production of antibodies which bind specifically to 
the immunizing antigen (and which do not bind to unrelated antigens). The selected Mab- 
secreting hybridomas are then cultured either in vitro (e.g., in tissue culture bottles or hollow 
fiber reactors), or in vivo (as ascites in mice). 

Other methods for sustaining antibody-producing B-cell clones, such as by EBV 
transformation, are known. 
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If desired, the antibodies (whether polyclonal or monoclonal) may be labeled using 
conventional techniques. Suitable labels include fluorophores, chromophores, radioactive atoms 
(particularly 32 P and l25 I), electron-dense reagents, enzymes, and ligands having specific binding 
partners. Enzymes are typically detected by their activity. For example, horseradish peroxidase 
is usually detected by its ability to convert 3,3',5,5'-tetramethylbenzidine (TNB) to a blue 
pigment, quantifiable with a spectrophotometer. 

A.2 In Vitro Applications of Polypeptides 

Some polypeptides of the invention will have enzymatic activities that are useful in vitro. 
For example, the soybean trypsin inhibitor (Kunitz) family is one of the numerous families of 
proteinase inhibitors. It comprises plant proteins which have inhibitory activity against serine 
proteinases from the trypsin and subtilisin families, thiol proteinases and aspartic proteinases. 
Thus, these peptides find in vitro use in protein purification protocols and perhaps in 
therapeutic settings requiring topical application of protease inhibitors. 

Delta-aminolevulinic acid dehydratase (EC 4.2.1.24 ) (ALAD) catalyzes the second 
step in the biosynthesis of heme, the condensation of two molecules of 5-aminolevulinate to 
form porphobilinogen and is also involved in chlorophyll biosynthesis(Kaczor et al. (1994) 
Plant Physiol. 1-4: 141 1-7; Smith (1988) Biochem. J. 249: 423-8; Schneider (1976) Z. 
naturforsch. [C] 31: 55-63). Thus, ALAD proteins can be used as catalysts in synthesis of 
heme derivatives. Enzymes of biosynthetic pathways generally can be used as catalysts for in 
vitro synthesis of the compounds representing products of the pathway. 

Polypeptides encoded by SDFs of the invention can be engineered to provide 
purification reagents to identify and purify additional polypeptides that bind to them. This 
allows one to identify proteins that function as multimers or elucidate signal transduction or 
metabolic pathways. In the case of DNA binding proteins, the polypeptide can be used in a 
similar manner to identify the DNA determinants of specific binding (S. Pierrou et al., Anal 
Biochem. 229:99 (1995), S. Chusacultanachai et al., J. Biol Chem. 274:23591 (1999), Q. Lin 
et al., / Biol Chem. ^272:27274 (1997)). 

n.B. POLYPEPTIDE VARIANTS , FRAGMENTS, AND FUSIONS 
Generally, variants , fragments, or fusions of the polypeptides encoded by the 
maximum length sequence(MLS) can exhibit at least one of the activities of the identified 
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domains and/or related polypeptides described in Sections (C) and (D) of The Reference 
tables corresponding to the MLS of interest. 



A type of variant of the native polypeptides comprises amino acid substitutions. 
Conservative substitutions, described above (see II.), are preferred to maintain the function or 
activity of the polypeptide. Such substitutions include conservation of charge, polarity, 
hydrophobicity, size, etc. For example, one or more amino acid residues within the sequence 
can be substituted with another amino acid of similar polarity that acts as a functional 
equivalent, for example providing a hydrogen bond in an enzymatic catalysis. Substitutes for an 
amino acid within an exemplified sequence are preferably made among the members of the class 
to which the amino acid belongs. For example, the nonpolar (hydrophobic) amino acids include 
alanine, leucine, isoleucine, valine, proline, phenylalanine, tryptophan and methionine. The 
polar neutral amino acids include glycine, serine, threonine, cysteine, tyrosine, asparagine, and 
glutamine. The positively charged (basic) amino acids include arginine, lysine and histidine. 
The negatively charged (acidic) amino acids include aspartic acid and glutamic acid. 

Within the scope of percentage of sequence identity described above, a polypeptide of 
the invention may have additional individual amino acids or amino acid sequences inserted into 
the polypeptide in the middle thereof and/or at the N-terminal and/or C-terminal ends thereof. 
Likewise, some of the amino acids or amino acid sequences may be deleted from the 
polypeptide. Amino acid substitutions may also be made in the sequences; conservative 
substitutions being preferred. 

One preferred class of variants are those that comprise (1) the domain of an encoded 
polypeptide and/or (2) residues conserved between the encoded polypeptide and related 
polypeptides. For this class of variants, the encoded polypeptide sequence is changed by 
insertion, deletion, or substitution at positions flanking the domain and/or conserved residues. 

Another class of variants includes those that comprise an encoded polypeptide 
sequence that is changed in the domain or conserved residues by a conservative substitution. 

Yet another class of variants includes those that lack one of the in vitro activities, or 
structural features of the encoded polypeptides. One example is polypeptides or proteins 
produced from genes comprising dominant negative mutations. Such a variant may comprise 
an encoded polypeptide sequence with non-conservative changes in a particular domain or 
group of conserved residues. 



ILB .(1) 



Variants 
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II.A.(2) FRAGMENTS 
Fragments of particular interest are those that comprise a domain identified for a 
polypeptide encoded by an MLS of the instant invention and variants thereof. Also, fragments 
that comprise at least one region of residues conserved between an MLS encoded polypeptide 
and its related polypeptides are of great interest. Fragments are sometimes useful as 
polypeptides corresponding to genes comprising dominant negative mutations are. 

n.A.(3) FUSIONS 
Of interest are chimeras comprising (1) a fragment of the MLS encoded polypeptide 
or variants thereof of interest and (2) a fragment of a polypeptide comprising the same 
domain. For example, an AP2 helix encoded by a MLS of the invention fused to second AP2 
helix from ANT protein, which comprises two AP2 helices. The present invention also 
encompasses fusions of MLS encoded polypeptides, variants, or fragments thereof fused with 
related proteins or fragments thereof. 

DEFINITION OF DOMAINS 

The polypeptides of the invention may possess identifying domains as shown in The 
Reference tables. Specific domains within the MLS encoded polypeptides are indicated in 
The Reference tables. In addition, the domains within the MLS encoded polypeptide can be 
defined by the region that exhibits at least 70% sequence identity with the consensus 
sequences listed in the detailed description below of each of the domains. 

The majority of the protein domain descriptions given in the protein domain table are 
obtained from Prosite, (http//www.expasy.ch/prosite/), and Pfam, 
(http//pfam.wustl.edu^rowse.shtml). Examples of domain descriptions are listed in the 
Protein Domain table. 

A. Activities of Polypeptides Comprising Signal Peptides 
Polypeptides comprising signal peptides are a family of proteins that are typically 
targeted to (1) a particular organelle or intracellular compartment, (2) interact with a 
particular molecule or (3) for secretion outside of a host cell. Example of polypeptides 
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comprising signal peptides include, without limitation, secreted proteins, soluble proteins, 
receptors, proteins retained in the ER, etc. 

These proteins comprising signal peptides are useful to modulate ligand-receptor 
interactions, cell-to-cell communication, signal transduction, intracellular communication, 
and activities and/or chemical cascades that take part in an organism outside or within of any 
particular cell. 

One class of such proteins are soluble proteins which are transported out of the cell. 
These proteins can act as ligands that bind to receptor to trigger signal transduction or to 
permit communication between cells. 

Another class is receptor proteins which also comprise a retention domain that lodges 
the receptor protein in the membrane when the cell transports the receptor to the surface of 
the cell. Like the soluble ligands, receptors can also modulate signal transduction and 
communication between cells. 

In addition the signal peptide itself can serve as a ligand for some receptors. An 
example is the interaction of the ER targeting signal peptide with the signal recognition 
particle (SRP). Here, the SRP binds to the signal peptide, halting translation, and the 
resulting SRP complex then binds to docking proteins located on the surface of the ER, 
prompting transfer of the protein into the ER. 

A description of signal peptide residue composition is described below in Subsection 

IV.C.l. 

III. Methods of Modulating Polypeptide Production 

It is contemplated that polynucleotides of the invention can be incorporated into a 
host cell or in- vitro system to modulate polypeptide production. For instance, the SDFs 
prepared as described herein can be used to prepare expression cassettes useful in a number of 
techniques for suppressing or enhancing expression. 

An example are polynucleotides comprising sequences to be transcribed, such as 
coding sequences, of the present invention can be inserted into nucleic acid constructs to 




Page 680 of 772 



Attorney Diet. 

Client Dkt. 80090.005 

modulate polypeptide production. Typically, such sequences to be transcribed are 
heterologous to at least one element of the nucleic acid construct to generate a chimeric gene 
or construct. 

Another example of useful polynucleotides are nucleic acid molecules comprising 
regulatory sequences of the present invention. Chimeric genes or constructs can be generated 
when the regulatory sequences of the invention linked to heterologous sequences in a vector 
construct. Within the scope of invention are such chimeric gene and/or constructs. 

Also within the scope of the invention are nucleic acid molecules, whereof at least a part 
or fragment of these DNA molecules are presented in the Reference and Sequence tables or 
polynucleotide encoding polypeptides of the Protein Group or Protein Group Matrix tables of 
the present application, and wherein the coding sequence is under the control of its own 
promoter and/or its own regulatory elements. Such molecules are useful for transforming the 
genome of a host cell or an organism regenerated from said host cell for modulating polypeptide 
production. 

Additionally, a vector capable of producing the oligonucleotide can be inserted into the 
host cell to deliver the oligonucleotide. 

More detailed description of components to be included in vector constructs are 
described both above and below. 

Whether the chimeric vectors or native nucleic acids are utilized, such 
polynucleotides can be incorporated into a host cell to modulate polypeptide production. 
Native genes and/or nucleic acid molecules can be effective when exogenous to the host cell. 
Methods of modulating polypeptide expression includes, without limitation: 
Suppression methods, such as 
Antisense 
Ribozymes 
Co-suppression 

Insertion of Sequences into the Gene to be Modulated 
Regulatory Sequence Modulation. 

as well as Methods for Enhancing Production, such as 
Insertion of Exogenous Sequences; and 
Regulatory Sequence Modulation. 
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m.A. Suppression 

Expression cassettes of the invention can be used to suppress expression of 
endogenous genes which comprise the SDF sequence. Inhibiting expression can be useful, 
for instance, to tailor the ripening characteristics of a fruit (Oeller et al., Science 254:437 
(1991)) or to influence seed size (WO98/07842) or to provoke cell ablation (Mariani et al, 
Nature 357: 384-387 (1992). 

As described above, a number of methods can be used to inhibit gene expression in 
plants, such as antisense, ribozyme, introduction of exogenous genes into a host cell, 
insertion of a polynucleotide sequence into the coding sequence and/or the promoter of the 
endogenous gene of interest, and the like. 



An expression cassette as described above can be transformed into host cell or plant to 
produce an antisense strand of RNA. For plant cells, antisense RNA inhibits gene expression 
by preventing the accumulation of mRNA which encodes the enzyme of interest, see, e.g., 
Sheehy et al., Proa Nat Acad. Sci. USA, 85:8805 (1988), and Hiatt et al, U.S. Patent No. 
4,801,340. 

m.A.2. Ribozymes 

Similarly, ribozyme constructs can be transformed into a plant to cleave mRNA and 
down-regulate translation. 



Another method of suppression is by introducing an exogenous copy of the gene to be 
suppressed. Introduction of expression cassettes in which a nucleic acid is configured in the 
sense orientation with respect to the promoter has been shown to prevent the accumulation of 
mRNA. A detailed description of this method is described above. 



Yet another means of suppressing gene expression is to insert a polynucleotide into the 
gene of interest to disrupt transcription or translation of the gene. 



m. A. 1. Antisense 



III. A.3. Co-Suppression 



III.A.4. 



Insertion of Sequences into the Gene to be Modulated 
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Homologous recombination could be used to target a polynucleotide insert to a gene 
using the Cre-Lox system (A.C. Vergunst et al., Nucleic Acids Res. 26:2729 (1998), A.C. 
Vergunst et al., Plant Mol. Biol 38:393 (1998), H. Albert et al., Plant! 7:649 (1995)). 

In addition, random insertion, of polynucleotides into a host cell genome can also be 
used to disrupt the gene of interest. Azpiroz-Leehan et al., Trends in Genetics J_3:152 (1997). 
In this method, screening for clones from a library containing random insertions is preferred 
for identifying those that have polynucleotides inserted into the gene of interest. Such 
screening can be performed using probes and/or primers described above based on sequences 
from the Reference and Sequence tables or polynucleotides encoding polypeptides of the Protein 
Group or Protein Group Matrix tables, fragments thereof, and substantially similar sequence 
thereto. The screening can also be performed by selecting clones or any transgenic plants 
having a desired phenotype. 

in.A.5. Regulatory SequenceModulation 
The SDFs described in the Reference and Sequence tables or polynucleotides encoding 
polypeptides of the Protein Group or Protein Group Matrix tables, and fragments thereof are 
examples of nucleotides of the invention that contain regulatory sequences that can be used to 
suppress or inactivate transcription and/or translation from a gene of interest as discussed in 



When suppression of production of the endogenous, native protein is desired it is 
often helpful to express a gene comprising a dominant negative mutation. Production of 
protein variants produced from genes comprising dominant negative mutations is a useful 
tool for research Genes comprising dominant negative mutations can produce a variant 
polypeptide which is capable of competing with the native polypeptide, but which does not 
produce the native result. Consequently, over expression of genes comprising these mutations 
can titrate out an undesired activity of the native protein. For example, The product from a 
gene comprising a dominant negative mutation of a receptor can be used to constitutively 
activate or suppress a signal transduction cascade, allowing examination of the phenotype 
and thus the trait(s) controlled by that receptor and pathway. Alternatively, the protein arising 
from the gene comprising a dominant-negative mutation can be an inactive enzyme still capable 



I.C.5. 



III. A.6. Genes Comprising Dominant-Negative Mutations 
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of binding to the same substrate as the native protein and therefore competes with such native 
protein. 

Products from genes comprising dominant-negative mutations can also act upon the 
native protein itself to prevent activity. For example, the native protein may be active only as a 
homo-multimer or as one subunit of a hetero-multimer. Incorporation of an inactive subunit into 
the multimer with native subunit(s) can inhibit activity. 

Thus, gene function can be modulated in host cells of interest by insertion into these 
cells vector constructs comprising a gene comprising a dominant-negative mutation. 



H[.B. Enhanced Expression 

Enhanced expression of a gene of interest in a host cell can be accomplished by either 
(1) insertion of an exogenous gene; or (2) promoter modulation. 

ni.B.l. Insertion of an Exogenous Gene 

Insertion of an expression construct encoding an exogenous gene can boost the number 
of gene copies expressed in a host cell. 

Such expression constructs can comprise genes that either encode the native protein that 
is of interest or that encode a variant that exhibits enhanced activity as compared to the native 
protein. Such genes encoding proteins of interest can be constructed from the sequences from 
the Reference and Sequence tables or polynucleotides encoding polypeptides of the Protein 
Group or Protein Group Matrix tables, fragments thereof, and substantially similar sequence 
thereto. 

Such an exogenous gene can include either a constitutive promoter permitting 
expression in any cell in a host organism or a promoter that directs transcription only in 
particular cells or times during a host cell life cycle or in response to environmental stimuli. 

IIIB.2. Regulatory Sequence Modulation 
The SDFs of the Reference and Sequence tables, and fragments thereof, contain 
regulatory sequences that can be used to enhance expression of a gene of interest. For example, 
some of these sequences contain useful enhancer elements. In some cases, duplication of 
enhancer elements or insertion of exogenous enhancer elements will increase expression of a 
desired gene from a particular promoter. As other examples, all 11 promoters require binding of 
a regulatory protein to be activated, while some promoters may need a protein that signals a 
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promoter binding protein to expose a polymerase binding site. In either case, over-production 
of such proteins can be used to enhance expression of a gene of interest by increasing the 
activation time of the promoter. 

Such regulatory proteins are encoded by some of the sequences in the Reference and 
Sequence tables or polynucleotides encoding polypeptides of the Protein Group or Protein 
Group Matrix tables, fragments thereof, and substantially similar sequences thereto. 

Coding sequences for these proteins can be constructed as described above. 

IV. Gene Constructs and Vector Construction 

To use isolated SDFs of the present invention or a combination of them or parts and/or 
mutants and/or fusions of said SDFs in the above techniques, recombinant DNA vectors which 
comprise said SDFs and are suitable for transformation of cells, such as plant cells, are usually 
prepared. The SDF construct can be made using standard recombinant DNA techniques 
(Sambrook et al. 1989) and can be introduced to the species of interest by Agrobacterium- 
mediated transformation or by other means of transformation {e.g., particle gun 
bombardment) as referenced below. 

The vector backbone can be any of those typical in the art such as plasmids, viruses, 
artificial chromosomes, BACs, YACs and PACs and vectors of the sort described by 

(a) BAC: Shizuya et al., Proc. Natl. Acad. Sci. USA 89: 8794-8797 (1992); 
Hamilton et al, Proc. Natl. Acad. Sci. USA 93: 9975-9979 (1996); 

(b) YAC: Burke et al, Science 236:806-812 (1987);. 

(c) PAC: Sternberg N. et al., Proc Natl Acad Sci USA. Jan;87(l): 103-7 (1990); 

(d) Bacteria-Yeast Shuttle Vectors: Bradshaw et al., Nucl Acids Res 23: 4850- 
4856 (1995); 

(e) Lambda Phage Vectors: Replacement Vector, e.g., 
Frischauf et al., J. Mol Biol 170: 827-842 (1983); or Insertion vector, e.g., 

Huynh et al., In: Glover NM (ed) DNA Cloning: A practical Approach, Voll Oxford: IRL 
Press (1985); 

(f) T-DNA gene fusion vectors rWalden et al., Mol Cell Biol 1: 175-194 (1990); 
and 

(g) Plasmid vectors: Sambrook et al., infra. 

Typically, a vector will comprise the exogenous gene, which in its turn comprises an 
SDF of the present invention to be introduced into the genome of a host cell, and which gene 
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may be an antisense construct, a ribozyme construct chimeraplast, or a coding sequence with 
any desired transcriptional and/or translational regulatory sequences, such as promoters, UTRs, 
and 3' end termination sequences. Vectors of the invention can also include origins of 
replication, scaffold attachment regions (SARs), markers, homologous sequences, introns, etc. 

A DNA sequence coding for the desired polypeptide, for example a cDNA sequence 
encoding a full length protein, will preferably be combined with transcriptional and translational 
initiation regulatory sequences which will direct the transcription of the sequence from the gene 
in the intended tissues of the transformed plant. 

For example, for over-expression, a plant promoter fragment may be employed that will 
direct transcription of the gene in all tissues of a regenerated plant. Alternatively, the plant 
promoter may direct transcription of an SDF of the invention in a specific tissue (tissue-specific 
promoters) or may be otherwise under more precise environmental control (inducible 
promoters). 

If proper polypeptide productionis desired, a polyadenylation region at the 3'-end of the 
coding region is typically included. The polyadenylation region can be derived from the natural 
gene, from a variety of other plant genes, or from T-DNA. 

The vector comprising the sequences from genes or SDF or the invention may 
comprise a marker gene that confers a selectable phenotype on plant cells. The vector can 
include promoter and coding sequence, for instance. For example, the marker may encode 
biocide resistance, particularly antibiotic resistance, such as resistance to kanamycin, G418, 
bleomycin, hygromycin, or herbicide resistance, such as resistance to chlorosulfuron or 
phosphinotricin. 

IV.A. Coding Sequences 

Generally, the sequence in the transformation vector and to be introduced into the 
genome of the host cell does not need to be absolutely identical to an SDF of the present 
invention. Also, it is not necessary for it to be full length, relative to either the primary 
transcription product or fully processed mRNA. Furthermore, the introduced sequence need 
not have the same intron or exon pattern as a native gene. Also, heterologous non-coding 
segments can be incorporated into the coding sequence without changing the desired amino 
acid sequence of the polypeptide to be produced. 

IV.B. Promoters 
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As explained above, introducing an exogenous SDF from the same species or an 
orthologous SDF from another species are useful to modulate the expression of a native gene 
corresponding to that SDF of interest. Such an SDF construct can be under the control of 
either a constitutive promoter or a highly regulated inducible promoter (e.g., a copper 
inducible promoter). The promoter of interest can initially be either endogenous or 
heterologous to the species in question. When re-introduced into the genome of said species, 
such promoter becomes exogenous to said species. Over-expression of an SDF transgene can 
lead to co-suppression of the homologous endogeneous sequence thereby creating some 
alterations in the phenotypes of the transformed species as demonstrated by similar analysis 
of the chalcone synthase gene (Napoli et al., Plant Cell 2:279 (1990) and van der Krol et al., 
Plant Cell 2:291 (1990)). If an SDF is found to encode a protein with desirable 
characteristics, its over-production can be controlled so that its accumulation can be 
manipulated in an organ- or tissue-specific manner utilizing a promoter having such 
specificity. 

Likewise, if the promoter of an SDF (or an SDF that includes a promoter) is found to 
be tissue-specific or developmentally regulated, such a promoter can be utilized to drive or 
facilitate the transcription of a specific gene of interest (e.g., seed storage protein or root- 
specific protein). Thus, the level of accumulation of a particular protein can be manipulated 
or its spatial localization in an organ- or tissue- specific manner can be altered. 

IV. C Signal Peptides 

SDFs of the present invention containing signal peptides are indicated in the 
Reference, Sequence, the Protein Group and Protein Group Matrix tables. In some cases it may 
be desirable for the protein encoded by an introduced exogenous or orthologous SDF to be 
targeted (1) to a particular organelle intracellular compartment, (2) to interact with a 
particular molecule such as a membrane molecule or (3) for secretion outside of the cell 
harboring the introduced SDF. This will be accomplished using a signal peptide. 

Signal peptides direct protein targeting, are involved in ligand-receptor interactions 
and act in cell to cell communication. Many proteins, especially soluble proteins, contain a 
signal peptide that targets the protein to one of several different intracellular compartments. 
In plants, these compartments include, but are not limited to, the endoplasmic reticulum (ER), 
mitochondria, plastids (such as chloroplasts), the vacuole, the Golgi apparatus, protein 
storage vessicles (PSV) and, in general, membranes. Some signal peptide sequences are 
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conserved, such as the Asn-Pro-Ile-Arg amino acid motif found in the N-terminal propeptide 
signal that targets proteins to the vacuole (Marty (1999) The Plant Cell 1 1 : 587-599). Other 
signal peptides do not have a consensus sequence per se 9 but are largely composed of 
hydrophobic amino acids, such as those signal peptides targeting proteins to the ER (Vitale 
and Denecke (1999) The Plant Cell 1 1 : 615-628). Still others do not appear to contain either 
a consensus sequence or an identified common secondary sequence, for instance the 
chloroplast stromal targeting signal peptides (Keegstra and Cline (1999) The Plant Cell 11: 
557-570). Furthermore, some targeting peptides are bipartite, directing proteins first to an 
organelle and then to a membrane within the organelle (e.g. within the thylakoid lumen of the 
chloroplast; see Keegstra and Cline (1999) The Plant Cell 1 1 : 557-570). In addition to the 
diversity in sequence and secondary structure, placement of the signal peptide is also varied. 
Proteins destined for the vacuole, for example, have targeting signal peptides found at the N- 
terminus, at the C-terminus and at a surface location in mature, folded proteins. Signal 
peptides also serve as ligands for some receptors. 

These characteristics of signal proteins can be used to more tightly control the 
phenotypic expression of introduced SDFs. In particular, associating the appropriate signal 
sequence with a specific SDF can allow sequestering of the protein in specific organelles 
(plastids, as an example), secretion outside of the cell, targeting interaction with particular 
receptors, etc. Hence, the inclusion of signal proteins in constructs involving the SDFs of the 
invention increases the range of manipulation of SDF phenotypic expression. The nucleotide 
sequence of the signal peptide can be isolated from characterized genes using common 
molecular biological techniques or can be synthesized in vitro. 

In addition, the native signal peptide sequences, both amino acid and nucleotide, 
described in the Reference, Sequence, Protein Group or Protein Group Matrix tables can be 
used to modulate polypeptide transport. Further variants of the native signal peptides 
described in the Reference, Sequence, Protein Group or Protein Group Matrix tables are 
contemplated. Insertions, deletions, or substitutions can be made. Such variants will retain at 
least one of the functions of the native signal peptide as well as exhibiting some degree of 
sequence identity to the native sequence. 

Also, fragments of the signal peptides of the invention are useful and can be fused 
with other signal peptides of interest to modulate transport of a polypeptide. 

V. Transformation Techniques 
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A wide range of techniques for inserting exogenous polynucleotides are known for a 
number of host cells, including, without limitation, bacterial, yeast, mammalian, insect and plant 
cells. 

Techniques for transforming a wide variety of higher plant species are well known and 
described in the technical and scientific literature. See, e.g. Weising et al., Ann. Rev. Genet. 
22:421 (1988); and Christou, Euphytica, v. 85, n.l-3:13-27, (1995). 

DNA constructs of the invention may be introduced into the genome of the desired plant 
host by a variety of conventional techniques. For example, the DNA construct may be 
introduced directly into the genomic DNA of the plant cell using techniques such as 
electroporation and microinjection of plant cell protoplasts, or the DNA constructs can be 
introduced directly to plant tissue using ballistic methods, such as DNA particle bombardment. 
Alternatively, the DNA constructs may be combined with suitable T-DNA flanking regions and 
introduced into a conventional Agrobacterium tumefaciens host vector. The virulence functions 
of the Agrobacterium tumefaciens host will direct the insertion of the construct and adjacent 
marker into the plant cell DNA when the cell is infected by the bacteria (McCormac et al., Mol. 
Biotechnol. 8:199 (1997); Hamilton, Gene 200:107 (1997)); Salomon et al. EMBOJ. 3:141 
(1984); Herrera-Estrella et al. EMBOJ. 2:987 (1983). 

Microinjection techniques are known in the art and well described in the scientific and 
patent literature. The introduction of DNA constructs using polyethylene glycol precipitation is 
described in Paszkowski et al. EMBOJ. 3:2717 (1984). Electroporation techniques are 
described in Fromm et al. Proc. Natl Acad. Sci. USA 82:5824 (1985). Ballistic transformation 
techniques are described in Klein et al. Nature 327:773 (1987). Agrobacterium 
tumefaciens-mediated transformation techniques, including disarming and use of binary or co- 
integrate vectors, are well described in the scientific literature. See, for example Hamilton, CM, 
Gene 200:107 (1997); Miiller et al. Mol. Gen. Genet. 207:171 (1987); Komari et al. Plant J. 
10:165 (1996); Venkateswarlu et al. Biotechnology 9:1103 (1991) and Gleave, AP., Plant Mol. 
Biol. 20:1203 (1992); Graves and Goldman, Plant Mol. Biol. 7:34 (1986) and Gould et al, Plant 
Physiology 95:426 (1991). 

Transformed plant cells which are derived by any of the above transformation 
techniques can be cultured to regenerate a whole plant that possesses the transformed genotype 
and thus the desired phenotype such as seedlessness. Such regeneration techniques rely on 
manipulation of certain phytohormones in a tissue culture growth medium, typically relying on a 
biocide and/or herbicide marker which has been introduced together with the desired nucleotide 
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sequences. Plant regeneration from cultured protoplasts is described in Evans et al., Protoplasts 
Isolation and Culture in "Handbook of Plant Cell Culture " pp. 124-176, MacMillan Publishing 
Company, New York, 1983; and Binding, Regeneration of Plants, Plant Protoplasts, pp. 21-73, 
CRC Press, Boca Raton, 1988. Regeneration can also be obtained from plant callus, explants, 
organs, or parts thereof. Such regeneration techniques are described generally in Klee et al. Ann. 
Rev. of Plant Phys. 38:467 (1987). Regeneration of monocots (rice) is described by Hosoyama 
et al. (Biosci. Biotechnol Biochem. 58:1500 (1994)) and by Ghosh et al. (J. Biotechnol 32:1 
(1994)). The nucleic acids of the invention can be used to confer desired traits on essentially any 
plant. 

Thus, the invention has use over a broad range of plants, including species from the 
genera Anacardium, Arachis, Asparagus, Atropa, Avena, Brassica, Citrus, Citrullus, Capsicum, 
Carthamus, Cocos, Coffea, Cucumis, Cucurbita, Daucus, Elaeis, Fragaria, Glycine, Gossypium, 
Helianthus, Heterocallis, Hordeum, Hyoscyamus, Lactuca, Linum, Lolium,Lupinus, 
Lycopersicon, Malus, Manihot, Majorana, Medicago, Nicotiana, Olea, Oryza, Panieum, 
Pannesetum, Persea, Phaseolus, Pistachia, Pisum, Pyrus, Prunus, Raphanus, Ricinus, Secale, 
Senecio, Sinapis, Solanum, Sorghum, Theobromus, Trigonella, Triticum, Vicia, Vitis, Vigna, 
and, Zea. 

One of skill will recognize that after the expression cassette is stably incorporated in 
transgenic plants and confirmed to be operable, it can be introduced into other plants by 
sexual crossing. Any of a number of standard breeding techniques can be used, depending 
upon the species to be crossed. 

The particular sequences of SDFs identified are provided in the attached Reference 
and Sequence tables. 
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DEFINITIONS 



Allelic variant: An "allelic variant" is an alternative form of the same SDF, which 

resides at the same chromosomal locus in the organism. Allelic variations can occur in any 
5 portion of the gene sequence, including regulatory regions. Allelic variants can arise by 
normal genetic variation in a population. Allelic variants can also be produced by genetic 
engineering methods. An allelic variant can be one that is found in a naturally occurring 
plant, including a cultivar or ecotype. An allelic variant may or may not give rise to a 
phenotypic change, and may or may not be expressed. An allele can result in a detectable 
10 change in the phenotype of the trait represented by the locus. A phenotypically silent allele 
can give rise to a product. 

Alternatively spliced messages: Within the context of the current invention, 

D "alternatively spliced messages" refers to mature mRNAs originating from a single gene with 

* ... 
JH> variations in the number and/or identity of exons, introns and/or intron-exon junctions. 

PJ Chimeric: The term "chimeric" is used to describe genes, as defined supra, or contructs 

i p- 

Li : 

fp wherein at least two of the elements of the gene or construct, such as the promoter and the 

^ coding sequence and/or other regulatory sequences and/or filler sequences and/or complements 

B *; 

QiO thereof, are heterologous to each other. 

ns; : 

iy 

S Constitutive Promoter: Promoters referred to herein as "constitutive promoters" actively 

promote transcription under most, but not necessarily all, environmental conditions and states of 
development or cell differentiation. Examples of constitutive promoters include the cauliflower 

25 mosaic virus (CaMV) 35S transcript initiation region and the V or T promoter derived from 
T-DNA of Agrobacterium tumefaciens, and other transcription initiation regions from various 
plant genes, such as the maize ubiquitin-1 promoter, known to those of skill 

Coordinately Expressed: The term "coordinately expressed," as used in the current 
30 invention, refers to genes that are expressed at the same or a similar time and/or stage and/or 
under the same or similar environmental conditions. 
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Domain: Domains are fingerprints or signatures that can be used to characterize protein 
families and/or parts of proteins. Such fingerprints or signatures can comprise conserved (1) 
primary sequence, (2) secondary structure, and/or (3) three-dimensional conformation. 
Generally, each domain has been associated with either a family of proteins or motifs. 
5 Typically, these families and/or motifs have been correlated with specific in-vitro and/or in- 
vivo activities. A domain can be any length, including the entirety of the sequence of a 
protein. Detailed descriptions of the domains, associated families and motifs, and correlated 
activities of the polypeptides of the instant invention are described below. Usually, the 
polypeptides with designated domain(s) can exhibit at least one activity that is exhibited by 
10 any polypeptide that comprises the same domain(s). 

Endogenous: The term "endogenous," within the context of the current invention refers to 
any polynucleotide, polypeptide or protein sequence which is a natural part of a cell or 
C organisms regenerated from said cell. 

is 

■aj- 
ar: 

% Exogenous: "Exogenous," as referred to within, is any polynucleotide, polypeptide or 

nJ protein sequence, whether chimeric or not, that is initially or subsequently introduced into 

m the genome of an individual host cell or the organism regenerated from said host cell by any 
means other than by a sexual cross. Examples of means by which this can be accomplished 

s ssr 

HO are described below, and include Agrobacterium-msdi&ted transformation (of dicots - e.g. 

% Salomon et al. EMBO J. 3:141 (1984); Herrera-Estrella et al. EMBO J. 2:987 (1983); of 

P monocots, representative papers are those by Escudero et al., Plant J. 10:355 (1996), Ishida et 

ST""' 

al., Nature Biotechnology 14:745 (1996), May et al., Bio/Technology 13:486 (1995)), biolistic 
methods (Armaleo et al., Current Genetics 17:97 1990)), electroporation, in planta 
25 techniques, and the like. Such a plant containing the exogenous nucleic acid is referred to 
here as a T 0 for the primary transgenic plant and Ti for the first generation. The term 
"exogenous" as used herein is also intended to encompass inserting a naturally found element 
into a non-naturally found location. 

30 Filler sequence: As used herein, "filler sequence" refers to any nucleotide sequence that 
is inserted into DNA construct to evoke a particular spacing between particular components 
such as a promoter and a coding region and may provide an additional attribute such as a 
restriction enzyme site. 
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Gene: The term "gene," as used in the context of the current invention, encompasses 

all regulatory and coding sequence contiguously associated with a single hereditary unit with 
a genetic function (see SCHEMATIC 1). Genes can include non-coding sequences that 
5 modulate the genetic function that include, but are not limited to, those that specify 

polyadenylation, transcriptional regulation, DNA conformation, chromatin conformation, 
extent and position of base methylation and binding sites of proteins that control all of these. 
Genes comprised of "exons" (coding sequences), which may be interrupted by "introns" 
(non-coding sequences), encode proteins. A gene's genetic function may require only RNA 

10 expression or protein production, or may only require binding of proteins and/or nucleic acids 
without associated expression. In certain cases, genes adjacent to one another may share 
sequence in such a way that one gene will overlap the other. A gene can be found within the 
genome of an organism, artificial chromosome, plasmid, vector, etc., or as a separate isolated 

C entity. 

ft 

J; Gene Family: "Gene family" is used in the current invention to describe a group of 

3 : 

fU functionally related genes, each of which encodes a separate protein. 

m 

* Heterologous sequences: "Heterologous sequences" are those that are not operatively 
0!0 linked or are not contiguous to each other in nature. For example, a promoter from com is 

S p considered heterologous to an Arabidopsis coding region sequence. Also, a promoter from a 

+ 

O gene encoding a growth factor from com is considered heterologous to a sequence encoding the 
corn receptor for the growth factor. Regulatory element sequences, such as UTRs or 3' end 
termination sequences that do not originate in nature from the same gene as the coding sequence 

25 originates from, are considered heterologous to said coding sequence. Elements operatively 

linked in nature and contiguous to each other are not heterologous to each other. On the other 
hand, these same elements remain operatively linked but become heterologous if other filler 
sequence is placed between them. Thus, the promoter and coding sequences of a corn gene 
expressing an amino acid transporter are not heterologous to each other, but the promoter and 

30 coding sequence of a com gene operatively linked in a novel manner are heterologous. 

Homologous gene: In the current invention, "homologous gene" refers to a gene that shares 
sequence similarity with the gene of interest. This similarity may be in only a fragment of the 
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sequence and often represents a functional domain such as, examples including without 
limitation a DNA binding domain, a domain with tyrosine kinase activity, or the like. The 
functional activities of homologous genes are not necessarily the same. 

5 Inducible Promoter: An "inducible promoter" in the context of the current invention refers to 
a promoter which is regulated under certain conditions, such as light, chemical concentration, 
protein concentration, conditions in an organism, cell, or organelle, etc. A typical example of an 
inducible promoter, which can be utilized with the polynucleotides of the present invention, is 
PARSK1, the promoter from the Arabidopsis gene encoding a serine-threonine kinase enzyme, 
10 and which promoter is induced by dehydration, abscissic acid and sodium chloride (Wang and 
Goodman, Plant J. 8:37 (1995)) Examples of environmental conditions that may affect 
transcription by inducible promoters include anaerobic conditions, elevated temperature, or the 
presence of light. 

. 5 s : 

3jj> Intergenic region: "Intergenic region," as used in the current invention, refers to 
% nucleotide sequence occurring in the genome that separates adjacent genes. 

m 

£ fl 

J Mutant gene: In the current invention, "mutant" refers to a heritable change in DNA 

1^ sequence at a specific location. Mutants of the current invention may or may not have an 
SO associated identifiable function when the mutant gene is transcribed. 

r- 

S Orthologous Gene: In the current invention "orthologous gene" refers to a second gene that 
encodes a gene product that performs a similar function as the product of a first gene. The 
orthologous gene may also have a degree of sequence similarity to the first gene. The 

25 orthologous gene may encode a polypeptide that exhibits a degree of sequence similarity to a 
polypeptide corresponding to a first gene. The sequence similarity can be found within a 
functional domain or along the entire length of the coding sequence of the genes and/or their 
corresponding polypeptides. 

30 Percentage of sequence identity: "Percentage of sequence identity," as used herein, is 

determined by comparing two optimally aligned sequences over a comparison window, where 
the fragment of the polynucleotide or amino acid sequence in the comparison window may 
comprise additions or deletions (e.g., gaps or overhangs) as compared to the reference sequence 
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(which does not comprise additions or deletions) for optimal alignment of the two sequences. 
The percentage is calculated by determining the number of positions at which the identical 
nucleic acid base or amino acid residue occurs in both sequences to yield the number of matched 
positions, dividing the number of matched positions by the total number of positions in the 
window of comparison and multiplying the result by 100 to yield the percentage of sequence 
identity. Optimal alignment of sequences for comparison may be conducted by the local 
homology algorithm of Smith and Waterman Add, APL. Math. 2:482 (1981), by the homology 
alignment algorithm of Needleman and Wunsch./ Mol. Biol. 48:443 (1970), by the search for 
similarity method of Pearson and Lipman Proc. Natl. Acad. Sci. (USA) 85: 2444 (1988), by 
computerized implementations of these algorithms (GAP, BESTFIT, BLAST, PASTA, and 
TFASTA in the Wisconsin Genetics Software Package, Genetics Computer Group (GCG), 575 
Science Dr., Madison, WI), or by inspection. Given that two sequences have been identified for 
comparison, GAP and BESTFIT are preferably employed to determine their optimal alignment. 
Typically, the default values of 5.00 for gap weight and 0.30 for gap weight length are used. 
The term "substantial sequence identity" between polynucleotide or polypeptide sequences 
refers to polynucleotide or polypeptide comprising a sequence that has at least 80% sequence 
identity, preferably at least 85%, more preferably at least 90% and most preferably at least 95%, 
even more preferably, at least 96%, 97%, 98% or 99% sequence identity compared to a 
reference sequence using the programs. 

Plant Promoter: A "plant promoter" is a promoter capable of initiating transcription in 
plant cells and can drive or facilitate transcription of a fragment of the SDF of the instant 
invention or a coding sequence of the SDF of the instant invention. Such promoters need not 
be of plant origin. For example, promoters derived from plant viruses, such as the CaMV35S 
promoter or from Agrobacterium tumefaciens such as the T-DNA promoters, can be plant 
promoters. A typical example of a plant promoter of plant origin is the maize ubiquitin-1 (ubi- 
l)promoter known to those of skill. 

Promoter: The term "promoter," as used herein, refers to a region of sequence determinants 
located upstream from the start of transcription of a gene and which are involved in recognition 
and binding of RNA polymerase and other proteins to initiate and modulate transcription. A 
basal promoter is the minimal sequence necessary for assembly of a transcription complex 
required for transcription initiation. Basal promoters frequently include a "TATA box" element 
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usually located between 15 and 35 nucleotides upstream from the site of initiation of 
transcription. Basal promoters also sometimes include a "CCAAT box" element (typically a 
sequence CCAAT) and/or a GGGCG sequence, usually located between 40 and 200 nucleotides, 
preferably 60 to 120 nucleotides, upstream from the start site of transcription. 



application, refers to any sequence that has been deposited in a publicly accessible database. 
This term encompasses both amino acid and nucleotide sequences. Such sequences are 
publicly accessible, for example, on the BLAST databases on the NCBI FTP web site 
(accessible at ncbi.nlm.gov/blast). The database at the NCBI GTP site utilizes "gi" numbers 
assigned by NCBI as a unique identifier for each sequence in the databases, thereby 
providing a non-redundant database for sequence from various databases, including 
GenBank, EMBL, DBBJ, (DNA Database of Japan) and PDB (Brookhaven Protein Data 
Bank). 

Regulatory Sequence: The term "regulatory sequence," as used in the current 

invention, refers to any nucleotide sequence that influences transcription or translation 
initiation and rate, and stability and/or mobility of the transcript or polypeptide product. 
Regulatory sequences include, but are not limited to, promoters, promoter control elements, 
protein binding sequences, 5' and 3' UTRs, transcriptional start site, termination sequence, 
polyadenylation sequence, introns, certain sequences within a coding sequence, etc. 

Related Sequences: "Related sequences" refer to either a polypeptide or a nucleotide 
sequence that exhibits some degree of sequence similarity with a sequence described by The 
Reference tables and The Sequence tables. 

Scaffold Attachment Region (SAR): As used herein, "scaffold attachment region" is a 

DNA sequence that anchors chromatin to the nuclear matrix or scaffold to generate loop 
domains that can have either a transcriptionally active or inactive structure (Spiker and 
Thompson (1996) Plant Physiol. 110: 15-21). 

Sequence-determined DNA fragments (SDFs): "Sequence-determined DNA fragments" 
as used in the current invention are isolated sequences of genes, fragments of genes, 



Public sequence: 



The term "public sequence ," as used in the context of the instant 
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intergenic regions or contiguous DNA from plant genomic DNA or cDNA or RNA the 
sequence of which has been determined. 



sequence that targets the protein for secretion, for transport to an intracellular compartment or 
organelle or for incorporation into a membrane. Signal peptides are indicated in the tables 
and a more detailed description located below. 

Specific Promoter: In the context of the current invention, "specific promoters" refers to a 
subset of inducible promoters that have a high preference for being induced in a specific 
tissue or cell and/or at a specific time during development of an organism. By "high 
preference" is meant at least 3-fold, preferably 5-fold, more preferably at least 10-fold still 
more preferably at least 20-fold, 50-fold or 100-fold increase in transcription in the desired 
tissue over the transcription in any other tissue. Typical examples of temporal and/or tissue 
specific promoters of plant origin that can be used with the polynucleotides of the present 
invention, are: PTA29, a promoter which is capable of driving gene transcription specifically in 
tapetum and only during anther development (Koltonow et al, Plant Cell 2:1201 (1990); RCc2 
and RCc3, promoters that direct root-specific gene transcription in rice (Xu et al, Plant Mol 
Biol 27:237 (1995); TobRB27, a root-specific promoter from tobacco (Y amamoto et al., Plant 
Cell 3:371 (1991)). Examples of tissue-specific promoters under developmental control include 
promoters that initiate transcription only in certain tissues or organs, such as root, ovule, fruit, 
seeds, or flowers. Other suitable promoters include those from genes encoding storage proteins 
or the lipid body membrane protein, oleosin. A few root-specific promoters are noted above. 

Stringency: "Stringency" as used herein is a function of probe length, probe composition 
(G + C content), and salt concentration, organic solvent concentration, and temperature of 
hybridization or wash conditions. Stringency is typically compared by the parameter T m , 
which is the temperature at which 50% of the complementary molecules in the hybridization 
are hybridized, in terms of a temperature differential from T m . High stringency conditions are 
those providing a condition of T m - 5°C to T m - 10°C. Medium or moderate stringency 
conditions are those providing T m - 20°C to T m - 29°C. Low stringency conditions are those 
providing a condition of T m - 40°C to T m - 48°C. The relationship of hybridization conditions 
to T m (in °C) is expressed in the mathematical equation 



Signal Peptide: 



A "signal peptide" as used in the current invention is an amino acid 
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T m = 81.5 -16.6(log 10 [Na + ]) + 0.41(%G+C) - (600/N) (1) 

where N is the length of the probe. This equation works well for probes 14 to 70 nucleotides in 
5 length that are identical to the target sequence. The equation below for T m of DNA-DNA 

hybrids is useful for probes in the range of 50 to greater than 500 nucleotides, and for conditions 
that include an organic solvent (formamide). 

T m = 81.5+16.6 log {[Na + ]/(l+0.7[Na + ])}+ 0.41(%G+C)-500/L 0.63(%formamide) 
10 (2) 

where L is the length of the probe in the hybrid. (P. Tijessen, "Hybridization with Nucleic 
Acid Probes" in Laboratory Techniques in Biochemistry and Molecular Biology, P.C. vand 
O der Vliet, ed., c. 1993 by Elsevier, Amsterdam.) The T m of equation (2) is affected by the 
5 nature of the hybrid; for DNA-RNA hybrids T m is 10-1 5°C higher than calculated, for RNA- 
X RNA hybrids T m is 20-25°C higher. Because the T m decreases about 1 °C for each 1% 
TU decrease in homology when a long probe is used (Bonner et al., 1 Mol Biol 81:123 (1973)), 
gi stringency conditions can be adjusted to favor detection of identical genes or related family 
L, members. 

W) Equation (2) is derived assuming equilibrium and therefore, hybridizations according 

sk: : 
I- I 

j£ to the present invention are most preferably performed under conditions of probe excess and 

^ for sufficient time to achieve equilibrium. The time required to reach equilibrium can be 

shortened by inclusion of a hybridization accelerator such as dextran sulfate or another high 
volume polymer in the hybridization buffer. 

25 Stringency can be controlled during the hybridization reaction or after hybridization 

has occurred by altering the salt and temperature conditions of the wash solutions used. The 
formulas shown above are equally valid when used to compute the stringency of a wash 
solution. Preferred wash solution stringencies lie within the ranges stated above; high 
stringency is 5-8°C below T m> medium or moderate stringency is 26-29°C below T m and low 

30 stringency is 45-48°C below T m . 

Substantially free of: A composition containing A is "substantially free of" B when 

at least 85% by weight of the total A+B in the composition is A. Preferably, A comprises at 
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least about 90% by weight of the total of A+B in the composition, more preferably at least 
about 95% or even 99% by weight. For example, a plant gene or DNA sequence can be 
considered substantially free of other plant genes or DNA sequences. 

Translational start site: In the context of the current invention, a "translational start 
site" is usually an ATG in the cDNA transcript, more usually the first ATG. A single cDNA, 
however, may have multiple translational start sites. 

Transcription start site: "Transcription start site" is used in the current invention to 
describe the point at which transcription is initiated. This point is typically located about 25 
nucleotides downstream from a TFIH) binding site, such as a TATA box. Transcription can 
initiate at one or more sites within the gene, and a single gene may have multiple transcriptional 
start sites, some of which may be specific for transcription in a particular cell-type or tissue. 

Untranslated region (UTR): A "UTR" is any contiguous series of nucleotide bases 

that is transcribed, but is not translated. These untranslated regions may be associated with 
particular functions such as increasing mRNA message stability. Examples of UTRs include, 
but are not limited to polyadenylation signals, terminations sequences, sequences located 
between the transcriptional start site and the first exon (5 5 UTR) and sequences located 
between the last exon and the end of the mRNA (3' UTR). 

Variant: The term "variant" is used herein to denote a polypeptide or protein or 
polynucleotide molecule that differs from others of its kind in some way. For example, 
polypeptide and protein variants can consist of changes in amino acid sequence and/or charge 
and/or post-translational modifications (such as glycosylation, etc). 
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EXAMPLES 

The invention is illustrated by way of the following examples. The invention is not 
limited by these examples as the scope of the invention is defined solely by the claims 
following. 

EXAMPLE 1: cDNA PREPARATION 

A number of the nucleotide sequences disclosed in the Reference and Sequence tables or 
polynucleotides encoding polypeptides of the Protein Group or Protein Group Matrix tables, 
herein as representative of the SDFs of the invention can be obtained by sequencing genomic 
DNA (gDNA) and/or cDNA from corn plants grown from HYBRID SEED # 35A19, purchased 
from Pioneer Hi-Bred International, Inc., Supply Management, P.O. Box 256, Johnston, Iowa 
50131-0256. 

A number of the nucleotide sequences disclosed in the Reference and Sequence tables 
or polynucleotides encoding polypeptides of the Protein Group or Protein Group Matrix tables, 
herein as representative of the SDFs of the invention can also be obtained by sequencing 
genomic DNA from Arabidopsis thaliana, Wassilewskija ecotype or by sequencing cDNA 
obtained from mRNA from such plants as described below. This is a true breeding strain. 
Seeds of the plant are available from the Arabidopsis Biological Resource Center at the Ohio 
State University, under the accession number CS2360. Seeds of this plant were deposited 
under the terms and conditions of the Budapest Treaty at the American Type Culture 
Collection, Manassas, VA on August 31, 1999, and were assigned ATCC No. PTA-595. 

Other methods for cloning full-length cDNA are described, for example, by Seki et 
al., Plant Journal 15:707-720 (1998) "High-efficiency cloning of Arabidopsis full-length 
cDNA by biotinylated Cap trapper"; Maruyama et al., Gene 138:171 (1994) "Oligo-capping a 
simple method to replace the cap structure of eukaryotic mRNAs with oligoribonucleotides"; 
and WO 96/34981. 

Tissues were, or each organ was, individually pulverized and frozen in liquid 
nitrogen. Next, the samples were homogenized in the presence of detergents and then 
centrifuged. The debris and nuclei were removed from the sample and more detergents were 
added to the sample. The sample was centrifuged and the debris was removed. Then the 
sample was applied to a 2M sucrose cushion to isolate polysomes. The RNA was isolated by 
treatment with detergents and proteinase K followed by ethanol precipitation and 
centrifugation. The polysomal RNA from the different tissues was pooled according to the 
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following mass ratios: 15/15/1 for male inflorescences, female inflorescences and root, 
respectively. The pooled material was then used for cDNA synthesis by the methods 
described below. 

Starting material for cDNA synthesis for the exemplary corn cDNA clones with 
sequences presented in the Reference and Sequence tables or polynucleotides encoding 
polypeptides of the Protein Group or Protein Group Matrix tables was poly(A)-containing 
polysomal mRNAs from inflorescences and root tissues of corn plants grown from HYBRID 
SEED # 35A19. Male inflorescences and female (pre-and post- fertilization) inflorescences 
were isolated at various stages of development. Selection for poly(A) containing polysomal 
RNA was done using oligo d(T) cellulose columns, as described by Cox and Goldberg, "Plant 
Molecular Biology: A Practical Approach", pp. 1-35, Shaw ed., c. 1988 by IRL, Oxford. The 
quality and the integrity of the polyA+ RNAs were evaluated. 

Starting material for cDNA synthesis for the exemplary Arabidopsis cDNA clones 
with sequences presented in the Reference and Sequence tables or polynucleotides encoding 
polypeptides of the Protein Group or Protein Group Matrix tables was polysomal RNA isolated 
from the top-most inflorescence tissues of Arabidopsis thaliana Wassilewskija (Ws.) and 
from roots of Arabidopsis thaliana Landsberg erecta (L. er.), also obtained from the 
Arabidopsis Biological Resource Center. Nine parts inflorescence to every part root was 
used, as measured by wet mass. Tissue was pulverized and exposed to liquid nitrogen. Next, 
the sample was homogenized in the presence of detergents and then centrifuged. The debris 
and nuclei were removed from the sample and more detergents were added to the sample. 
The sample was centrifuged and the debris was removed and the sample was applied to a 2M 
sucrose cushion to isolate polysomal RNA. Cox et al., "Plant Molecular Biology: A Practical 
Approach", pp. 1-35, Shaw ed., c. 1988 by IRL, Oxford. The polysomal RNA was used 
for cDNA synthesis by the methods described below. Polysomal mRNA was then isolated as 
described above for corn cDNA. The quality of the RNA was assessed electrophoretically. 

Following preparation of the mRNAs from various tissues as described above, selection 
of mRNA with intact 5' ends and specific attachment of an oligonucleotide tag to the 5 5 end of 
such mRNA was performed using either a chemical or enzymatic approach. Both techniques 
take advantage of the presence of the "cap" structure, which characterizes the 5' end of most 
intact mRNAs and which comprises a guanosine generally methylated once, at the 7 position. 

The chemical modification approach involves the optional elimination of the 2', 3'-cis 
diol of the 3' terminal ribose, the oxidation of the 2', 3'-cis diol of the ribose linked to the cap of 
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the 5' ends of the mRNAs into a dialdehyde, and the coupling of the such obtained dialdehyde to 
a derivatized oligonucleotide tag. Further detail regarding the chemical approaches for 
obtaining mRNAs having intact 5' ends are disclosed in International Application No. 
W096/34981 published November 7, 1996. 

The enzymatic approach for ligating the oligonucleotide tag to the intact 5' ends of 
mRNAs involves the removal of the phosphate groups present on the 5' ends of uncapped 
incomplete mRNAs, the subsequent decapping of mRNAs having intact 5' ends and the ligation 
of the phosphate present at the 5' end of the decapped mRNA to an oligonucleotide tag. Further 
detail regarding the enzymatic approaches for obtaining mRNAs having intact 5' ends are 
disclosed in Dumas Milne Edwards J.B. (Doctoral Thesis of Paris VI University, Le clonage des 
ADNc complets: difficultes et perspectives nouvelles. Apports pour l'etude de la regulation de 
l'expression de la tryptophane hydroxylase de rat, 20 Dec. 1993), EP0 625572 and Kato et al 9 
Gene 150:243-250 (1994). 

In both the chemical and the enzymatic approach, the oligonucleotide tag has a 
restriction enzyme site (e.g. an EcoRI site) therein to facilitate later cloning procedures. 
Following attachment of the oligonucleotide tag to the mRNA, the integrity of the mRNA is 
examined by performing a Northern blot using a probe complementary to the oligonucleotide 



For the mRNAs joined to oligonucleotide tags using either the chemical or the enzymatic 
method, first strand cDNA synthesis is performed using an oligo-dT primer with reverse 
transcriptase. This oligo-dT primer can contain an internal tag of at least 4 nucleotides, which 
can be different from one mRNA preparation to another. Methylated dCTP is used for cDNA 
first strand synthesis to protect the internal EcoRI sites from digestion during subsequent steps. 
The first strand cDNA is precipitated using isopropanol after removal of RNA by alkaline 
hydrolysis to eliminate residual primers. 

Second strand cDNA synthesis is conducted using a DNA polymerase, such as Klenow 
fragment and a primer corresponding to the 5' end of the ligated oligonucleotide. The primer is 
typically 20-25 bases in length. Methylated dCTP is used for second strand synthesis in order to 
protect internal EcoRI sites in the cDNA from digestion during the cloning process. 

Following second strand synthesis, the full-length cDNAs are cloned into a phagemid 
vector, such as pBlueScript™ (Stratagene). The ends of the full-length cDNAs are blunted with 
T4 DNA polymerase (Biolabs) and the cDNA is digested with EcoRI. Since methylated dCTP 
is used during cDNA synthesis, the EcoRI site present in the tag is the only hemi-methylated 



tag. 
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site; hence the only site susceptible to EcoRI digestion. In some instances, to facilitate 
subcloning, an Hind EQ adapter is added to the 3' end of full-length cDNAs. 

The full-length cDNAs are then size fractionated using either exclusion chromatography 
(AcA, Biosepra) or electrophoretic separation which yields 3 to 6 different fractions. The full- 
length cDNAs are then directionally cloned either into pBlueScript™ using either the EcoRI and 
Smal restriction sites or, when the Hind HI adapter is present in the full-length cDNAs, the 
EcoRI and Hind III restriction sites. The ligation mixture is transformed, preferably by 
electroporation, into bacteria, which are then propagated under appropriate antibiotic selection. 

Clones containing the oligonucleotide tag attached to full-length cDNAs are selected as 
follows. 

The plasmid cDNA libraries made as described above are purified (e.g. by a column 
available from Qiagen). A positive selection of the tagged clones is performed as follows. 
Briefly, in this selection procedure, the plasmid DNA is converted to single stranded DNA using 
phage Fl gene II endonuclease in combination with an exonuclease (Chang et al., Gene 127:95 
(1993)) such as exonuclease m or T7 gene 6 exonuclease. The resulting single stranded DNA is 
then purified using paramagnetic beads as described by Fry et al, Biotechniques 13: 124 (1992). 
Here the single stranded DNA is hybridized with a biotinylated oligonucleotide having a 
sequence corresponding to the 3' end of the oligonucleotide tag. Preferably, the primer has a 
length of 20-25 bases. Clones including a sequence complementary to the biotinylated 
oligonucleotide are selected by incubation with streptavidin coated magnetic beads followed by 
magnetic capture. After capture of the positive clones, the plasmid DNA is released from the 
magnetic beads and converted into double stranded DNA using a DNA polymerase such as 
ThermoSequenase™ (obtained from Amersham Pharmacia Biotech). Alternatively, protocols 
such as the Gene Trapper™ kit (Gibco BRL) can be used. The double stranded DNA is then 
transformed, preferably by electroporation, into bacteria. The percentage of positive clones 
having the 5' tag oligonucleotide is typically estimated to be between 90 and 98% from dot blot 
analysis. 

Following transformation, the libraries are ordered in microtiter plates and sequenced. 
The Arabidopsis library was deposited at the American Type Culture Collection on January 
7, 2000 as "E-coli liba 010600" under the accession numbe r PTA-1161 . 
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G. EXAMPLE 2: Southern hybridizations 

The SDFs of the invention can be used in Southern hybridizations as described above. 
The following describes extraction of DNA from nuclei of plant cells, digestion of the 
nuclear DNA and separation by length, transfer of the separated fragments to membranes, 
preparation of probes for hybridization, hybridization and detection of the hybridized probe. 

The procedures described herein can be used to isolate related polynucleotides or for 
diagnostic purposes. Moderate stringency hybridization conditions, as defined above, are 
described in the present example. These conditions result in detection of hybridization 
between sequences having at least 70% sequence identity. As described above, the 
hybridization and wash conditions can be changed to reflect the desired percenatge of 
sequence identity between probe and target sequences that can be detected. 

In the following procedure, a probe for hybridization is produced from two PCR 
reactions using two primers from genomic sequence of Arabidopsis thaliana. As described 
above, the particular template for generating the probe can be any desired template. 

The first PCR product is assessed to validate the size of the primer to assure it is of 
the expected size. Then the product of the first PCR is used as a template, with the same pair 
of primers used in the first PCR, in a second PCR that produces a labeled product used as the 
probe. 

Fragments detected by hybridization, or other bands of interest, can be isolated from 
gels used to separate genomic DNA fragments by known methods for further purification 
and/or characterization. 



Buffers for nuclear DNA extraction 
1. 10XHB 





1000 ml 




40 mM spermidine 


10.2 g 


Spermine (Sigma S-2876) and spermidine (Sigma 
S-2501) 


10 mM spermine 

r 


3.5 g 


Stabilize chromatin and the nuclear membrane 


0.1 MEDTA 
(disodium) 


37.2 g 


EDTA inhibits nuclease 
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0.1 MTris 


12.1 g 


Buffer 


0.8 M KC1 


59.6 g 


Adjusts ionic strength for stability of nuclei 



Adjust pH to 9.5 with 10 N NaOH. It appears that there is a nuclease present in 
leaves. Use of pH 9.5 appears to inactivate this nuclease. 



2. 2 M sucrose (684 g per 1000 ml) 

Heat about half the final volume of water to about 50°C. Add the sucrose slowly then 
bring the mixture to close to final volume; stir constantly until it has dissolved. Bring 
the solution to volume. 

3. Sarkosyl solution (lyses nuclear membranes) 



1000 ml 

N-lauroyl sarcosine (Sarkosyl) 20.0 g 

0.1 MTris 12.1 g 

0.04 M EDTA (Disodium) 14.9 g 

Adjust the pH to 9.5 after all the components are dissolved and bring up to the proper 
volume. 



4. 20% Triton X- 100 
80 ml Triton X-100 
320 ml lxHB (w/o p-ME and PMSF) 
Prepare in advance; Triton takes some time to dissolve 

A. Procedure 

1. Prepare IX "H" buffer (keep ice-cold during use) 



10XHB 



1000 ml 



100 ml 
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2 M sucrose 



250 ml a non-ionic osmoticum 



Water 



634 ml 



Added just before use: 



6-mercaptoethanol 



lOOmMPMSF* 



10 ml a protease inhibitor; protects 

nuclear membrane proteins 

1 ml inactivates nuclease by reducing 



disulfide bonds 



*100mMPMSF 

(phenyl methyl sulfonyl fluoride, Sigma P-7626) 
(add 0.0875 g to 5 ml 100% ethanol) 

Homogenize the tissue in a blender (use 300-400 ml of lxHB per blender). Be sure 
that you use 5-10 ml of HB buffer per gram of tissue. Blenders generate heat so be 
sure to keep the homogenate cold. It is necessary to put the blenders in ice 
periodically. 

Add the 20% Triton X-100 (25 ml per liter of homogenate) and gently stir on ice for 
20 min. This lyses plastid, but not nuclear, membranes. 

Filter the tissue suspension through several nylon filters into an ice-cold beaker. The 
first filtration is through a 250-micron membrane; the second is through an 85-micron 
membrane; the third is through a 50-micron membrane; and the fourth is through a 
20-micron membrane. Use a large funnel to hold the filters. Filtration can be sped up 
by gently squeezing the liquid through the filters. 

Centrifuge the filtrate at 1200 x g for 20 min. at 4°C to pellet the nuclei. 

Discard the dark green supernatant. The pellet will have several layers to it. One is 
starch; it is white and gritty. The nuclei are gray and soft. In the early steps, there 
may be a dark green and somewhat viscous layer of chloroplasts. 
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Wash the pellets in about 25 ml cold H buffer (with Triton X-100) and resuspend by 
swirling gently and pipetting. After the pellets are resuspended. 

Pellet the nuclei again at 1200 - 1300 x g. Discard the supernatant. 

Repeat the wash 3-4 times until the supernatant has changed from a dark green to a 
pale green. This usually happens after 3 or 4 resuspensions. At this point, the pellet 
is typically grayish white and very slippery. The Triton X-100 in these repeated steps 
helps to destroy the chloroplasts and mitochondria that contaminate the prep. 

Resuspend the nuclei for a final time in a total of 15 ml of H buffer and transfer the 
suspension to a sterile 125 ml Erlenmeyer flask. 

Add 15 ml, dropwise, cold 2% Sarkosyl, 0.1 M Tris, 0.04 M EDTA solution (pH 9.5) 
while swirling gently. This lyses the nuclei. The solution will become very viscous. 

Add 30 grams of CsCl and gently swirl at room temperature until the CsCl is in 
solution. The mixture will be gray, white and viscous. 

Centrifuge the solution at 1 1,400 x g at 4°C for at least 30 min. The longer this spin 
is, the firmer the protein pellicle. 

The result is typically a clear green supernatant over a white pellet, and (perhaps) 
under a protein pellicle. Carefully remove the solution under the protein pellicle and 
above the pellet. Determine the density of the solution by weighing 1 ml of solution 
and add CsCl if necessary to bring to 1.57 g/ml. The solution contains dissolved 
solids (sucrose etc) and the refractive index alone will not be an accurate guide to 
CsCl concentration. 

Add 20 |il of 10 mg/ml EtBr per ml of solution. 

Centrifuge at 184,000 x g for 16 to 20 hours in a fixed-angle rotor. 
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13. Remove the dark red supernatant that is at the top of the tube with a plastic transfer 
pipette and discard. Carefully remove the DNA band with another transfer pipette. 
The DNA band is usually visible in room light; otherwise, use a long wave UV light 
to locate the band. 

14. Extract the ethidium bromide with isopropanol saturated with water and salt. Once 
the solution is clear, extract at least two more times to ensure that all of the EtBr is 
gone. Be very gentle, as it is very easy to shear the DNA at this step. This extraction 
may take a while because the DNA solution tends to be very viscous. If the solution 
is too viscous, dilute it with TE. 

15. Dialyze the DNA for at least two days against several changes (at least three times) of 
TE (10 mM Tris, ImM EDTA, pH 8) to remove the cesium chloride. 

16. Remove the dialyzed DNA from the tubing. If the dialyzed DNA solution contains a 
lot of debris, centrifuge the DNA solution at least at 2500 x g for 10 min. and 
carefully transfer the clear supernatant to a new tube. Read the A260 concentration of 
the DNA. 

17. Assess the quality of the DNA by agarose gel electrophoresis (1% agarose gel) of the 
DNA. Load 50 ng and 100 ng (based on the OD reading) and compare it with known 
and good quality DNA. Undigested lambda DNA and a lambda-Hindlll-digested 
DNA are good molecular weight makers. 

Protocol for Digestion of Genomic DNA 



1 . The relative amounts of DNA for different crop plants that provide approximately a 
balanced number of genome equivalent is given in Table 3. Note that due to the size 
of the wheat genome, wheat DNA will be underrepresented. Lambda DNA provides 
a useful control for complete digestion. 



Protocol: 
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Precipitate the DNA by adding 3 volumes of 100% ethanol. Incubate at -20°C for at 
least two hours. Yeast DNA can be purchased and made up at the necessary 
concentration, therefore no precipitation is necessary for yeast DNA. 

Centrifuge the solution at 1 1,400 x g for 20 min. Decant the ethanol carefully (be 
careful not to disturb the pellet). Be sure that the residual ethanol is completely 
removed either by vacuum desiccation or by carefully wiping the sides of the tubes 
with a clean tissue. 

Resuspend the pellet in an appropriate volume of water. Be sure the pellet is fully 
resuspended before proceeding to the next step. This may take about 30 min. 

Add the appropriate volume of 10X reaction buffer provided by the manufacturer of 
the restriction enzyme to the resuspended DNA followed by the appropriate volume 
of enzymes. Be sure to mix it properly by slowly swirling the tubes. 

Set-up the lambda digestion-control for each DNA that you are digesting. 

o 

Incubate both the experimental and lambda digests overnight at 37 C. Spin down 
condensation in a microfuge before proceeding. 

After digestion, add 2 jil of loading dye (typically 0.25% bromophenol blue, 0.25% 
xylene cyanol in 15% Ficoll or 30% glycerol) to the lambda-control digests and load 
in 1% TPE-agarose gel (TPE is 90 mM Tris-phosphate, 2 mM EDTA, pH 8). If the 
lambda DNA in the lambda control digests are completely digested, proceed with the 
precipitation of the genomic DNA in the digests. 

Precipitate the digested DNA by adding 3 volumes of 100% ethanol and incubating in 
-20°C for at least 2 hours (preferably overnight). 

EXCEPTION: Arabidopsis and yeast DNA are digested in an appropriate volume; 
they don't have to be precipitated. 
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10. Resuspend the DNA in an appropriate volume of TE (e.g., 22 (il x 50 blots = 1 100 |il) 
and an appropriate volume of 10X loading dye (e.g., 2.4 |il x 50 blots = 120 jal). Be 
careful in pipetting the loading dye - it is viscous. Be sure you are pipetting the 
correct volume. 



Table 3 

10 Some guide points in digesting genomic DNA. 











ucnonie 


Amuuni 










T-*nni vfllpnt \c\ 0 lie 

1^L[U1 V did 11 lU Z, 


of DNA 








Size Relative to 


Arabidonsis DNA 


per blot 






Genome Size 


Arabidopsis 






i Pi 

r- : 


Species 










*?•* 

0 

n\ 


Arabidopsis 


120 Mb 


IX 


IX 


2^g 


Ul 


Brassica 


1,100 Mb 


9.2X 


0.54X 


10 jxg 


m 

a 


Corn 


2,800 Mb 


23.3X 


0.43X 


20 ng 


w 


Cotton 


2,300 Mb 


19.2X 


0.52X 


20 ng 




Oat 


11,300 Mb 


94X 


0.11X. 


20 ng 


■ZZl. 


Rice 


400 Mb 


3.3X 


0.75X 


5 Hg 


y. 


Soybean 


1,100 Mb 


9.2X 


0.54X 


10 ng 




Sugarbeet 


758 Mb 


6.3X 


0.8X 


10 fig 




Sweetclover 


1,100 Mb 


9.2X 


0.54X 


10 Jig 




Wheat 


16,000 Mb 


133X 


0.08X 


20 fig 




Yeast 


15 Mb 


0.1 2X 


IX 


0.25 jig 



Protocol for Southern Blot Analysis 

The digested DNA samples are electrophoresed in 1% agarose gels in lx TPE buffer. 
Low voltage; overnight separations are preferred. The gels are stained with EtBr and 
15 photographed. 
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1 . For blotting the gels, first incubate the gel in 0.25 N HC1 (with gentle shaking) for 
about 15 min. 

2. Then briefly rinse with water. The DNA is denatured by 2 incubations. Incubate 
(with shaking) in 0.5 M NaOH in 1.5 M NaCl for 15 min. 



3. The gel is then briefly rinsed in water and neutralized by incubating twice (with 
shaking) in 1.5 M Tris pH 7.5 in 1.5 M NaCl for 15 min. 

4. A nylon membrane is prepared by soaking it in water for at least 5 min, then in 6X 
SSC for at least 15 min. before use. (20x SSC is 175.3 g NaCl, 88.2 g sodium citrate 
per liter, adjusted to pH 7.0.) 

5. The nylon membrane is placed on top of the gel and all bubbles in between are 
removed. The DNA is blotted from the gel to the membrane using an absorbent 
medium, such as paper toweling and 6x SCC buffer. After the transfer, the membrane 
may be lightly brushed with a gloved hand to remove any agarose sticking to the 
surface. 

o 

6. The DNA is then fixed to the membrane by UV crosslinking and baking at 80 C. The 
membrane is stored at 4°C until use. 

B. Protocol for PCR Amplification of Genomic Fragments in Arabidopsis 



Amplification procedures : 

1 . Mix the following in a 0.20 ml PCR tube or 96-well PCR plate: 



Volume 


Stock 


Final Amount or Cone. 


0.5 jil 


~ 10 ng/iil genomic DNA 1 


5ng 


2.5 \i\ 


1 OX PCR buffer 


20 mM Tris, 50 mM KC1 



1 Arabidopsis DNA is used in the present experiment, but the procedure is a general one. 
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0.75 ul 


50 mM MgCl 2 


1.5 mM 


1»1 


10 pmol/ul Primer 1 (Forward) 


1 0 pmol 


lul 


10 pmol/ul Primer 2 (Reverse) 


10 pmol 


0.5 ul 


5 mM dNTPs 


0.1 mM 


0.1 pi 


5 units/ul Platinum Taq™ (Life 
Technologies, Gaithersburg, MD) 
DNA Polymerase 


1 units 


(to 25 ul) 


Water 





2. The template DNA is amplified using a Perkin Elmer 9700 PCR machine: 



1) 94 C for 10 min. followed by 



2) 




3) 




4) 


5 cycles: 




5 cycles: 




25 cycles: 


94 °C- 


30 sec 


94 °C- 


30 sec 


94 °C - 30 sec 


62 °C - 


30 sec 


58 °C- 


30 sec 


53 °C- 30 sec 


72 °C- 


3 min 


72 °C- 


3 min 


72 °C- 3 min 



5) 72 C for 7 min. Then the reactions are stopped by chilling to 4 C. 



The procedure can be adapted to a multi-well format if necessary. 
Quantification and Dilution of PCR Products: 

1. The product of the PCR is analyzed by electrophoresis in a 1% agarose gel. A 

linearized plasmid DNA can be used as a quantification standard (usually at 50, 100, 
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200, and 400 ng). These will be used as references to approximate the amount of 
PCR products. Hindlll-digested Lambda DNA is useful as a molecular weight 
marker. The gel can be run fairly quickly; e.g., at 100 volts. The standard gel is 
examined to determine that the size of the PCR products is consistent with the 
expected size and if there are significant extra bands or smeary products in the PCR 
reactions. 

2. The amounts of PCR products can be estimated on the basis of the plasmid standard. 

3. For the small number of reactions that produce extraneous bands, a small amount of 
DNA from bands with the correct size can be isolated by dipping a sterile 10-(il tip 
into the band while viewing though a UV Transilluminator. The small amount of 
agarose gel (with the DNA fragment) is used in the labeling reaction. 

C. Protocol for PCR-DIG-Labeling of DNA 
Solutions : 

Reagents in PCR reactions (diluted PCR products, 10X PCR Buffer, 50 mM MgCl 2 , 5 
U/jil Platinum Taq Polymerase, and the primers) 

10X dNTP + DIG-1 1-dUTP [1 :5]: (2 mM dATP, 2 mM dCTP, 2 mM dGTP, 1.65 
mM dTTP, 0.35 mM DIG-1 1-dUTP) 

10X dNTP + DIG-1 1-dUTP [1:10]: (2 mM dATP, 2 mM dCTP, 2 mM dGTP, 1.81 
mM dTTP, 0.19 mM DIG-1 1-dUTP) 

10X dNTP + DIG-1 1-dUTP [1:15]: (2 mM dATP, 2 mM dCTP, 2 mM dGTP, 1.875 
mM dTTP, 0.125 mM DIG-1 1-dUTP) 

TE buffer (10 mM Tris, 1 mM EDTA, pH 8) 

Maleate buffer: In 700 ml of deionized distilled water, dissolve 1 1.61 g maleic acid 
and 8.77 g NaCl. Add NaOH to adjust the pH to 7.5. Bring the volume to 1 L. Stir 
for 15 min. and sterilize. 
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10% blocking solution: In 80 ml deionized distilled water, dissolve 1.1 6g maleic 
acid. Next, add NaOH to adjust the pH to 7.5. Add 10 g of the blocking reagent 

o 

powder (Boehringer Mannheim, Indianapolis, EN, Cat. no. 1096176). Heat to 60 C 
while stirring to dissolve the powder. Adjust the volume to 100 ml with water. Stir 
and sterilize. 

1% blocking solution: Dilute the 10% stock to 1% using the maleate buffer. 

Buffer 3 (100 mM Tris, 100 mM NaCl, 50 mM MgCl 2 , pH9.5). Prepared from 
autoclaved solutions of 1M Tris pH 9.5, 5 M NaCl, and 1 M MgCl 2 in autoclaved 
distilled water. 
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Procedure: 



PCR reactions are performed in 25 ul volumes containing: 



PCR buffer 
MgCl 2 

10X dNTP + DIG-1 1-dUTP 
Platinum Taq™ Polymerase 
10 pg probe DNA 
10 pmol primer 1 



IX 

1.5 mM 

IX (please see the note below) 
1 unit 



Note: 



IPX dNTP + DIG-1 1-dUTP (1 :5) 



10X dNTP + DIG-1 1-dUTP (1:10) 
1 OX dNTP + DIG- 1 1 -dUTP (1:15) 



Use for : 
< 1 kb 



1 kbto 1.8 kb 
> 1.8 kb 



The PCR reaction uses the following amplification cycles: 



1) 


94 C for 10 min. 








2) 




3) 




4) 


5 cycles: 




5 cycles: 




25 cycles: 


95°C - 


30 sec 


95°C - 


30 sec 


95°C - 30 sec 


61°C - 


1 min 


59°C - 


1 min 


51°C - 1 min 


73°C - 


5 min 


75°C - 


5 min 


73°C - 5 min 



o o 

5) 72 C for 8 min. The reactions are terminated by chilling to 4 C (hold). 

3. The products are analyzed by electrophoresis- in a 1% agarose gel, comparing to an 
aliquot of the unlabelled probe starting material. 

4. The amount of DIG-labeled probe is determined as follows: 
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Make serial dilutions of the diluted control DNA in dilution buffer (TE: 10 mM Tris 
and 1 mM EDTA, pH 8) as shown in the following table: 



DIG-labeled control 
DNA starting cone. 


Stepwise Dilution 


Final Cone. (Dilution 
Name) 


5 ng/|il 


1 |Ltl in 49 \xl TE 


lOOpg/^l (A) 


100 pg/nl (A) 


25 jil in 25 |il TE 


50 pg/jil (B) 


50pg/jil(B) 


25 \il in 25 \i\ TE 


25 pg/nl(C) 


25 pg/nl (C) 


20 nl in 30 \il TE 


10pg/nl(D) 



a. Serial deletions of a DIG-labeled standard DNA ranging from 100 pg to 10 pg 
are spotted onto a positively charged nylon membrane, marking the membrane 
lightly with a pencil to identify each dilution. 



b. Serial dilutions (e.g., 1:50, 1:2500, 1:10,000) of the newly labeled DNA probe 
are spotted. 

c. The membrane is fixed by UV crosslinking. 

d. The membrane is wetted with a small amount of maleate buffer and then 
incubated in 1% blocking solution for 15 min at room temp. 

e. The labeled DNA is then detected using alkaline phosphatase conjugated anti- 
DIG antibody (Boehringer Mannheim, Indianapolis, IN, cat. no. 1093274) and 
an NBT substrate according to the manufacture's instruction. 

f. Spot intensities of the control and experimental dilutions are then compared to 
estimate the concentration of the PCR-DIG-labeled probe. 
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D. Prehybridization and Hybridization of Southern Blots 
Solutions: 



100% Formamide purchased from Gibco 

20X SSC (IX = 0.15 M NaCl, 0.015 M Na 3 citrate) 

perL: 175 gNaCl 

87.5gNa 3 citrate-2H 2 0 



20% Sarkosyl (N-lauroyl-sarcosine) 
20% SDS (sodium dodecyl sulphate) 



10%o Blocking Reagent: In 80 ml deionized distilled water, dissolve 1.16 g maleic 
acid. Next, add NaOH to adjust the pH to 7.5. Add 10 g of the blocking reagent 
powder. Heat to 60°C while stirring to dissolve the powder. Adjust the volume 
to 100 ml with water. Stir and sterilize. 



Prehybridization Mix: 



Final 

Concentration 


Components 


Volume 
(per 100 ml) 


Stock 


50% 


Formamide 


50 ml 


100% 


5X 


SSC 


25 ml 


20X 


0.1% 


Sarkosyl 


0.5 ml 


20% 


0.02% 


SDS 


0.1 ml 


20% 


2% 


Blocking Reagent 


20 ml 


10% 




Water 


4.4 ml 





General Procedures : 

1 . Place the blot in a heat-sealable plastic bag and add an appropriate volume of 

prehybridization solution (30 ml/1 00cm 2 ) at room temperature. Seal the bag with a 
heat sealer, avoiding bubbles as much as possible. Lay down the bags in a large 
plastic tray (one tray can accommodate at least 4-5 bags). Ensure that the bags are 
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lying flat in the tray so that the prehybridization solution is evenly distributed 
throughout the bag. Incubate the blot for at least 2 hours with gentle agitation using a 
waver shaker. 

2. Denature DIG-labeled DNA probe by incubating for 10 min. at 98°C using the PCR 
machine and immediately cool it to 4°C. 

3. Add probe to prehybridization solution (25 ng/ml; 30 ml = 750 ng total probe) and 
mix well but avoid foaming. Bubbles may lead to background. 

4. Pour off the prehybridization solution from the hybridization bags and add new 
prehybridization and probe solution mixture to the bags containing the membrane. 

5. Incubate with gentle agitation for at least 16 hours. 

6. Proceed to medium stringency post-hybridization wash: 

Three times for 20 min. each with gentle agitation using IX SSC, 1% SDS at 60°C. 

All wash solutions must be prewarmed to 60°C. Use about 100 ml of wash solution 
per membrane. 

To avoid background keep the membranes fully submerged to avoid drying in spots; 
agitate sufficiently to avoid having membranes stick to one another. 

7. After the wash, proceed to immunological detection and CSPD development. 
E. Procedure for Immunological Detection with CSPD 



Solutions: 



Buffer 1: 



Maleic acid buffer (0.1 M maleic acid, 0.15 M NaCl; 
adjusted to pH 7.5 with NaoH) 
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Maleic acid buffer with 0.3% (v/v) Tween 20. 



Blocking stock solution 



10% blocking reagent in buffer 1 . Dissolve (10X 
concentration): blocking reagent powder (Boehringer 
Mannheim, Indianapolis, IN, cat. no. 1096176) by 
constantly stirring on a 65°C heating block or heat in a 

o 

microwave, autoclave and store at 4 C. 



Buffer 2 



(IX blocking solution): 



Dilute the stock solution 1 : 1 0 in Buffer 1 . 



Detection buffer: 



0.1 MTris, 0.1 MNaCl, pH 9.5 



Procedure : 

1. After the post-hybridization wash the blots are briefly rinsed (1-5 min.) in the maleate 
washing buffer with gentle shaking. 

2. Then the membranes are incubated for 30 min. in Buffer 2 with gentle shaking. 

3. Anti-DIG-AP conjugate (Boehringer Mannheim, Indianapolis, IN, cat. no. 1093274) 
at 75 mU/ml (1 : 10,000) in Buffer 2 is used for detection. 75 ml of solution can be 
used for 3 blots. 

4. The membrane is incubated for 30 min. in the antibody solution with gentle shaking. 

5. The membrane are washed twice in washing buffer with gentle shaking. About 250 
mis is used per wash for 3 blots. 

6. The blots are equilibrated for 2-5 min in 60 ml detection buffer. 

7. Dilute CSPD (1 :200) in detection buffer. (This can be prepared ahead of time and 
stored in the dark at 4°C). 

The following steps must be done individually. Bags (one for detection and one for 
exposure) are generally cut and ready before doing the following steps. 

8. The blot is carefully removed from the detection buffer and excess liquid removed 
without drying the membrane. The blot is immediately placed in a bag and 1 .5 ml of 
CSPD solution is added. The CSPD solution can be spread over the membrane. 
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Bubbles present at the edge and on the surface of the blot are typically removed by 
gentle rubbing. The membrane is incubated for 5 min. in CSPD solution. 

9. Excess liquid is removed and the membrane is blotted briefly (DNA side up) on 
Whatman 3 MM paper. Do not let the membrane dry completely. 

o 

10. Seal the damp membrane in a hybridization bag and incubate for 10 min at 37 C to 
enhance the luminescent reaction. 

1 1 . Expose for 2 hours at room temperature to X-ray film. Multiple exposures can be 
taken. Luminescence continues for at least 24 hours and signal intensity increases 
during the first hours. 

EXAMPLE 3: MICROARRAY EXPERIMENTS AND RESULTS 



Example 3: MICROARRAY EXPERIMENTS AND RESULTS 
1. Sample Tissue Preparation 

(a) Roots 

Seeds of Arabidopsis thaliana (Ws) were sterilized in full strength bleach for less than 
5 min., washed more than 3 times in sterile distilled deionized water and plated on MS agar 
plates. The plates were placed at 4°C for 3 nights and then placed vertically into a growth 
chamber having 16 hr light/8 hr dark cycles, 23 °C, 70% relative humidity and ~1 1,000 LUX. 
After 2 weeks, the roots were cut from the agar, flash frozen in liquid nitrogen and stored at - 
80°C.(EXPT REP: 108439 and 108434) 



(b) Root Hairless mutants 

Plants mutant at the rhl gene locus lack root hairs. This mutation is maintained as a 
heterozygote. 

Seeds of Arabidopsis thaliana (Landsberg erecta) mutated at the rhl gene locus were 
sterilized using 30% bleach with 1 ul/ml 20% Triton -X 100 and then vernalized at 4°C for 3 
days before being plated onto GM agar plates. Plates were placed in growth chamber with 16 
hr light/8 hr. dark, 23°C, 14,500-15,900 LUX, and 70% relative humidity for germination and 
growth. 
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After 7 days, seedlings were inspected for root hairs using a dissecting microscope. 
Mutants were harvested and the cotyledons removed so that only root tissue remained. 
Tissue was then flash frozen in liquid nitrogen and stored at -80C. (EXPT REP: 108433) 

Arabidopsis thaliana (Landsberg erecta) seedlings grown and prepared as above were 
used as controls. (EXPT REP: 108433) 

Alternatively, seeds of Arabidopsis thaliana (Landsberg erecta), heterozygous for the 
rhll (root hairless) mutation, were surface-sterilized in 30% bleach containing 0.1% Triton 
X-100 and further rinsed in sterile water. They were then vernalized at 4° C for 4 days before 
being plated onto MS agar plates. The plates were maintained in a growth chamber at 24°C 
with 16 hr light/8 hr dark for germination and growth. After 10 days, seedling roots that 
expressed the phenotype (i.e. lacking root hairs) were cut below the hypocotyl junction, 
frozen in liquid nitrogen and stored at -80°C. Those seedlings with the normal root 
phenotype (heterozygous or wt) were collected as described for the mutant and used as 
controls. 

(c) Rosette Leaves, Stems, and Siliques Arabidopsis thaliana (Ws) seed was 
vernalized at4° C for 3 days before sowing in Metro-mix soil type 350. Flats were placed in a 
growth chamber having 16 hr light/8 hr dark, 80% relative humidity, 23°C and 13,000 LUX 
for germination and growth. After 3 weeks, rosette leaves, stems, and siliques (see EXPT 
REP: 108436, 108437 and 108438) were harvested, flash frozen in liquid nitrogen and stored 
at -80°C until use. After 4 weeks, siliques (<5mm, 5-10 mm and >10 mm) were harvested, 
flash frozen in liquid nitrogen and stored at -80°C until use. 5 week old whole plants (used as 
controls) were harvested, flash frozen in liquid nitrogen and kept at -80°C until RNA was 
isolated. 

(d) Trichomes 

Arabidopsis thaliana (Colombia glabrous) inflorescences were used as a control and 
CS8143 (hairy inflorescence ecotype) inflorescences, having increased trichomes, were used 
as the experimental sample. 

Approximately 10 (al of each type of seed was sown on a flat of 350 soil (containing 
0.03% marathon) and vernalized at 4° C for 3 days. Plants were then grown at room 
temperature under florescent lighting. Young inflorescences were collected at 30 days for the 
control plants and 37 days for the experimental plants. Each inflorescence was cut into one- 
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half inch (1/2") pieces, flash frozen in liquid nitrogen and stored at -80°C until RNA was 
isolated. 



(e) Germination 

Arabidopsis thaliana seeds (ecotype Ws) were sterilized in bleach and rinsed with 
sterile water. The seeds were placed in 100mm petri plates containing soaked autoclaved 
filter paper. Plates were foil-wrapped and left at 4°C for 3 nights to vernalize. After cold 
treatment, the foil was removed and plates were placed into a growth chamber having 16 hr 
light/8 hr dark cycles, 23 °C, 70% relative humidity and —11,000 lux. Seeds were collected 1 
d (EXPT REP: 108461), 2 d (EXPT REP: 108462), 3 d (EXPT REP: 108463) and 4 d (EXPT 
REP: 108464) later, flash frozen in liquid nitrogen and stored at -80°C until RNA was 
isolated. 

(f) Shoot Apical Meristem 

Arabidopsis thaliana (Landsberg erecta) plants mutant at the stm gene locus lack 
shoot meristems, produce aerial rosettes, have a reduced number of flowers per inflorescence, 
as well as a reduced number of petals, stamens and carpels, and is female sterile. This 
mutation is maintained as a heterozygote. 

Seeds of Arabidopsis thaliana (Landsberg erecta) mutated at the stm locus were 
sterilized using 30% bleach with 1 ul/ml 20% Triton -XI 00. The seeds were vernalized at 
4°C for 3 days before being plated onto GM agar plates. Half were then put into a 22°C, 24 hr 
light growth chamber and half in a 24°C 16 hr light/8 hr dark growth chamber having 14,500- 
15,900 LUX, and 70% relative humidity for germination and growth. 

After 7 days, seedlings were examined for leaf primordia using a dissecting 
microscope. Presence of leaf primordia indicated a wild type phenotype. Mutants were 
selected based on lack of leaf primordia. Mutants were then harvested and hypocotyls 
removed leaving only tissue in the shoot region. Tissue was then flash frozen in liquid 
nitrogen and stored at -80°C. 

Control tissue was isolated from 5 day old Landsberg erecta seedlings grown in the same 
manner as above. Tissue from the shoot region was harvested in the same manner as the stm 
tissue, but only contained material from the 24°C, 16 hr light/8 hr dark long day cycle growth 
chamber. (EXPT REP: 108453) 
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Seeds of maize hybrid 35 A (Pioneer) were sown in water-moistened sand in flats (10 
rows, 5-6 seed/row) and covered with clear, plastic lids before being placed in a growth 
chamber having 16 hr light (25°C)/8 hr dark (20°C), 75% relative humidity and 13,000- 
14,000 LUX. Covered flats were watered every three days for 8 days. Seedlings were 
carefully removed from the sand and the outer layers of leaf shealth removed. About 2 mm 
sections were cut and flash frozen in liquid nitrogen prior to storage at -80°C. The tissues 
above the shoot apices (-1 cm long) were cut, treated as above and used as control tissue. 



(g) Abscissic acid (ABA) 

Seeds of Arabidopsis thaliana (ecotype Wassilewskija) were sown in trays and left at 
4°C for 4 days to vernalize. They were then transferred to a growth chamber having grown 
16 hr light/8 hr dark, 13,000 LUX, 70% humidity, and 20°C and watered twice a week with 1 
L of IX Hoagland's solution. Approximately 1,000 14 day old plants were spayed with 200- 
250 mis of 100 ABA in a 0.02% solution of the detergent Silwet L-77. Whole seedlings, 
including roots, were harvested within a 15 to 20 minute time period at 1 hr and 6 hr after 
treatment, flash- frozen in liquid nitrogen and stored at -80°C. 

Seeds of maize hybrid 35 A (Pioneer) were sown in water-moistened sand in flats (10 
rows, 5-6 seed/row) and covered with clear, plastic lids before being placed in a growth 
chamber having 16 hr light (25°C)/8 hr dark (20°C), 75% relative humidity and 13,000- 
14,000 LUX. Covered flats were watered every three days for 7 days. Seedlings were 
carefully removed from the sand and placed in 1 -liter beakers with 100 jaM ABA for 
treatment. Control plants were treated with water. After 6 hr and 24 hr, aerial and root tissues 
were separated and flash frozen in liquid nitrogen prior to storage at -80°C. 



(h) Auxin Responsive 

Seeds of Arabidopsis thaliana (ecotype Wassilewskija) were sown in trays and left at 4°C 
for 4 days to vernalize. They were then transferred to a growth chamber having 16 hr light/8 
hr dark, 13,000 LUX, 70% humidity, 20°C and watered twice a week with 1 L of IX 
Hoagland's solution (recipe recited in Feldmann et al., (1987) Mol. Gen. Genet. 208: 1-9 and 
described as complete nutrient solution). Approximately 1,000 14 day old plants were 
spayed with 200-250 mis of 100 NAA in a 0.02% solution of the detergent Silwet L-77. 
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Aerial tissues (everything above the soil line) were harvested within a 15 to 20 minute time 
period 1 hr and 6 hrs after treatment, flash-frozen in liquid nitrogen and stored at -80°C. 

Seeds of maize hybrid 35A (Pioneer) were sown in water-moistened sand in flats (10 
rows, 5-6 seed/row) and covered with clear, plastic lids before being placed in a growth 
chamber having 16 hr light (25°C)/8 hr dark (20°C), 75% relative humidity and 13,000- 
14,000 LUX. Covered flats were watered every three days for 7 days. Seedlings were 
carefully removed from the sand and placed in 1 -liter beakers with 100 |iM NAA for 
treatment. Control plants were treated with water. After 6 hr and 24 hr, aerial and root tissues 
were separated and flash frozen in liquid nitrogen prior to storage at -80°C. 

(i) Cytokinin 

Seeds of Arabidopsis thaliana (ecotype Wassilewskija) were sown in trays and left at 
4°C for 4 days to vernalize. They were then transferred to a growth chamber having 16 hr 
light/8 hr dark, 13,000 LUX, 70% humidity, 20°C temperature and watered twice a week with 
1 L of IX Hoagland's solution. Approximately 1,000 14 day old plants were spayed with 
200-250 mis of 100 \xM BA in a 0.02% solution of the detergent Silwet L-77. Aerial tissues 
(everything above the soil line) were harvested within a 15 to 20 minute time period 1 hr and 
6 hrs after treatment, flash-frozen in liquid nitrogen and stored at -80°C. 

Seeds of maize hybrid 35 A (Pioneer) were sown in water-moistened sand in flats (10 
rows, 5-6 seed/row) and covered with clear, plastic lids before being placed in a growth 
chamber having 16 hr light (25°C)/8 hr dark (20°C), 75% relative humidity and 13,000- 
14,000 LUX. Covered flats were watered every three days for 7 days. Seedlings were 
carefully removed from the sand and placed in 1 -liter beakers with 100 |iM BA for treatment. 
Control plants were treated with water. After 6 hr, aerial and root tissues were separated and 
flash frozen in liquid nitrogen prior to storage at -80°C. 

(j) Brassinosteroid Responsive 

Two separate experiments were performed, one with epi-brassinolide and one with the 
brassinosteroid biosynthetic inhibitor brassinazole. 

In the epi-brassinolide experiments, seeds of wild-type Arabidopsis thaliana (ecotype 
Wassilewskija) and the brassinosteroid biosynthetic mutant dwf4-l were sown in trays and 
left at 4°C for 4 days to vernalize. They were then transferred to a growth chamber having 16 
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hr light/8 hr dark, 1 1,000 LUX, 70% humidity and 22°C temperature. Four week old plants 
were spayed with a 1 jaM solution of epi-brassinolide and shoot parts (unopened floral 
primordia and shoot apical meristems) harvested three hours later. Tissue was flash-frozen in 
liquid nitrogen and stored at -80°C.(EXPT REP 108480) 

In the brassinazole experiments, seeds of wild-type Arabidopsis thaliana (ecotype 
Wassilewskija) were grown as described above. Four week old plants were spayed with a 1 
\\M solution of brassinazole and shoot parts (unopened floral primordia and shoot apical 
meristems) harvested three hours later. Tissue was flash-frozen in liquid nitrogen and stored 
at -80°C.(EXPT REP 108481) 

In addition to the spray experiments, tissue was prepared from two different mutants; 
(1) a dwfX-l knock out mutant (EXPT REP: 108478) and (2) a mutant overexpressing the 
dwf4-\ gene (EXPT REP: 108479). 

Seeds of wild-type Arabidopsis thaliana (ecotype Wassilewskija) and of the dwfA-\ 
knock out and overexpressor mutants were sown in trays and left at 4°C for 4 days to 
vernalize. They were then transferred to a growth chamber having 16 hr light/8 hr dark, 
1 1,000 LUX, 70°/o humidity and 22°C temperature. Tissue from shoot parts (unopened floral 
primordia and shoot apical meristems) was flash-frozen in liquid nitrogen and stored at - 



Another experiment was completed with seeds of Arabidopsis thaliana (ecotype 
Wassilewskija) were sown in trays and left at 4°C for 4 days to vernalize. They were then 
transferred to a growth chamber. Plants were grown under long-day (16 hr light: 8 hr. dark) 
conditions, 13,000 LUX light intensity, 70% humidity, 20°C temperature and watered twice a 
week with 1 L IX Hoagland's solution(recipe recited in Feldmann et al., (1987) Mol. Gen. 
Genet. 208: 1-9 and described as complete nutrient solution). Approximately 1,000 14 day 
old plants were spayed with 200-250 mis of 0.1 |iM Epi-Brassinolite in 0.02% solution of the 
detergent Silwet L-77. At 1 hr. and 6 hrs. after treatment aerial tissues were harvested within 
a 15 to 20 minute time period and flash-frozen in liquid nitrogen. 

Seeds of maize hybrid 35A (Pioneer) were sown in water-moistened sand in flats (10 
rows, 5-6 seed/row) and covered with clear, plastic lids before being placed in a growth 
chamber having 16 hr light (25°C)/8 hr dark (20°C), 75% relative humidity and 13,000- 
14,000 LUX. Covered flats were watered every three days for 7 days. Seedlings were 
carefully removed from the sand and placed in 1 -liter beakers with 0.1 \iM epi-brassinolide 



80°C. 
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for treatment. Control plants were treated with distilled deionized water. After 24 hr, aerial 
and root tissues were separated and flash frozen in liquid nitrogen prior to storage at -80°C. 



(k) Gibberillic Acid 

Seeds of Arabidops is thaliana (ecotype Wassilewskija) were sown in trays and 
left at 4°C for 4 days to vernalize. They were then transferred to a growth chamber having 16 
hr light/8 hr. dark, 13,000 LUX, 70% humidity, 20°C and watered twice a week with 1 L of 
IX Hoagland's solution. Approximately 1,000 14 day old plants were spayed with 200-250 
mis of 100 gibberillic acid in a 0.02% solution of the detergent Silwet L-77. At 1 hr. and 
6 hrs. after treatment, aerial tissues (everything above the soil line) were harvested within a 
15 to 20 minute time period, flash-frozen in liquid nitrogen and stored at -80°C. 

Alternatively, seeds of Arabidopsis thaliana (ecotype Ws) were sown in Metro-mix 
soil type 350 and left at 4°C for 3 days to vernalize. They were then transferred to a growth 
chamber having 16 hr light/8 hr dark, 13,000 LUX, 80% humidity, 20°C temperature and 
watered every four days with 1.5 L water. 14 days after germination, plants were sprayed 
with 100 |iM gibberillic acid or with water. Aerial tissues were harvested 1 hr (EXPT REP: 
108484), 6 hrs (EXPT REP: 108485), 12 hrs (EXPT REP: 108486), and 24 hrs post- 
treatment, flash frozen and stored at -80°C. 

Seeds of maize hybrid 35A (Pioneer) were sown in water-moistened sand in flats (10 
rows, 5-6 seed/row) and covered with clear, plastic lids before being placed in a growth 
chamber having 16 hr light (25°C)/8 hr dark (20°C), 75% relative humidity and 13,000- 
14,000 LUX. Covered flats were watered every three days for 7 days. Seedlings were 
carefully removed from the sand and placed in 1 -liter beakers with 100 jaM gibberillic acid 
for treatment. Control plants were treated with water. After 1 hr, 6 hr and 12 hr, aerial and 
root tissues were separated and flash frozen in liquid nitrogen prior to storage at -80°C. 

(I) Nitrogen: High to Low 

Wild type Arabidopsis thaliana seeds (ecotpye Ws) were surface sterilized with 30% 
Clorox, 0.1% Triton X-100 for 5 minutes. Seeds were then rinsed with 4-5 exchanges of 
sterile double distilled deionized water. Seeds were vernalized at 4°C for 2-4 days in 
darkness. After cold treatment, seeds were plated on modified IX MS media (without 
NH4NO3 or KNO3), 0.5% sucrose, 0.5g/L MES pH5.7, 1% phytagar and supplemented with 
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KNO3 to a final concentration of 60 mM (high nitrate modified IX MS media). Plates were 
then grown for 7 days in a Percival growth chamber at 22°C with 16 hr. light/8 hr dark. 

Germinated seedlings were then transferred to a sterile flask containing 50 mL of high 
nitrate modified IX MS liquid media. Seedlings were grown with mild shaking for 3 
additional days at 22°C in 16 hr. light/8 hr dark (in a Percival growth chamber) on the high 
nitrate modified IX MS liquid media. 

After three days of growth on high nitrate modified IX MS liquid media, seedlings 
were transferred either to a new sterile flask containing 50 mL of high nitrate modified IX 
MS liquid media or to low nitrate modified IX MS liquid media (containing 20 DM KNO3). 
Seedlings were grown in these media conditions with mild shaking at 22°C in 16 hr light/ 8 hr 
dark for the appropriate time points and whole seedlings harvested for total RNA isolation via 
the Trizol method (LifeTech.). The time points used for the microarray experiments were 10 
min. (EXPT REP: 108454) and 1 hour (EXPT REP: 108455) time points for both the high 
and low nitrate modified IX MS media. 

Alternatively, seeds that were surface sterilized in 30% bleach containing 0.1% Triton 
X-100 and further rinsed in sterile water, were planted on MS agar, (0.5% sucrose) plates 
containing 50 mM KNO3 (potassium nitrate). The seedlings were grown under constant light 
(3500 LUX) at 22°C. After 12 days, seedlings were transferred to MS agar plates containing 
either ImM KN0 3 or 50 mM KN0 3 . Seedlings transferred to agar plates containing 50 mM 
KNO3 were treated as controls in the experiment. Seedlings transferred to plates with ImM 
KNO3 were rinsed thoroughly with sterile MS solution containing 1 mM KNO3. There were 
ten plates per transfer. Root tissue was collected and frozen in 15 mL Falcon tubes at various 
time points which included 1 hour, 2 hours, 3 hours, 4 hours, 6 hours, 9 hours, 12 hours, 16 
hours, and 24 hours. 

Maize 35A19 Pioneer hybrid seeds were sown on flats containing sand and grown in a 
Conviron growth chamber at 25°C, 16 hr light/8 hr dark, -13,000 LUX and 80% relative 
humidity. Plants were watered every three days with double distilled deionized water. 
Germinated seedlings are allowed to grow for 10 days and were watered with high nitrate 
modified IX MS liquid media (see above). On day 11, young corn seedlings were removed from 
the sand (with their roots intact) and rinsed briefly in high nitrate modified IX MS liquid media. 
The equivalent of half a flat of seedlings were then submerged (up to their roots) in a beaker 
containing either 500 mL of high or low nitrate modified IX MS liquid media (see above for 
details). 
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At appropriate time points, seedlings were removed from their respective liquid 
media, the roots separated from the shoots and each tissue type flash frozen in liquid nitrogen 
and stored at -80°C. This was repeated for each time point. Total RNA was isolated using the 
Trizol method (see above) with root tissues only. 

Corn root tissues isolated at the 4 hr and 16 hr time points were used for the 
microarray experiments. Both the high and low nitrate modified IX MS media were used. 

(m) Nitrogen: Low to High 

Arabidopsis thaliana ecotype Ws seeds were sown on flats containing 4 L of a 1 :2 
mixture of Grace Zonolite vermiculite and soil. Flats were watered with 3 L of water and 
vernalized at 4°C for five days. Flats were placed in a Conviron growth chamber having 16 hr 
light/8 hr dark at 20°C, 80% humidity and 17,450 LUX. Flats were watered with 
approximately 1.5 L of water every four days. Mature, bolting plants (24 days after 
germination) were bottom treated with 2 L of either a control (100 mM mannitol pH 5.5) or 
an experimental (50 mM ammonium nitrate, pH 5.5) solution. Roots, leaves and siliques were 
harvested separately 30, 120 and 240 minutes after treatment, flash frozen in liquid nitrogen 
and stored at -80°C. 

Hybrid maize seed (Pioneer hybrid 35A19) were aerated overnight in deionized water. 
Thirty seeds were plated in each flat, which contained 4 liters of Grace zonolite vermiculite. 
Two liters of water were bottom fed and flats were kept in a Conviron growth chamber with 
16 hr light/8 hr dark at 20°C and 80% humidity. Flats were watered with 1 L of tap water 
every three days. Five day old seedlings were treated as described above with 2 L of either a 
control (100 mM mannitol pH 6.5) solution or 1 L of an experimental (50 mM ammonium 
nitrate, pH 6.8) solution. Fifteen shoots per time point per treatment were harvested 10, 90 
and 180 minutes after treatment, flash frozen in liquid nitrogen and stored at -80°C. 

Alternatively, seeds of Arabidopsis thaliana (ecotype Wassilewskija) were left at 4°C 
for 3 days to vernalize. They were then sown on vermiculite in a growth chamber having 16 
hours light/8 hours dark, 12,000-14,000 LUX, 70% humidity, and 20°C. They were bottom- 
watered with tap water, twice weekly. Twenty- four days old plants were sprayed with either 
water (control) or 0.6% ammonium nitrate at 4 jaL/cm 2 of tray surface. Total shoots and 
some primary roots were cleaned of vermiculite, flash-frozen in liquid nitrogen and stored at 



-80°C. 
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(n) Methyl Jasmonate 

Seeds of Arabidopsis thaliana (ecotype Wassilewskija) were sown in trays and left at 
4°C for 4 days to vernalize before being transferred to a growth chamber having 16 hr light/8 
hr. dark, 13,000 LUX, 70% humidity, 20°C temperature and watered twice a week with 1 L 
of a IX Hoagland's solution. Approximately 1,000 14 day old plants were spayed with 200- 
250 mis of 0.001% methyl jasmonate in a 0.02% solution of the detergent Silwet L-77. At 1 
hr and 6 hrs after treatment, whole seedlings, including roots, were harvested within a 15 to 
20 minute time period, flash- frozen in liquid nitrogen and stored at -80°C. 

Seeds of maize hybrid 35A (Pioneer) were sown in water-moistened sand in flats (10 
rows, 5-6 seed/row) and covered with clear, plastic lids before being placed in a growth 
chamber having 16 hr light (25°C)/8 hr dark (20°C), 75% relative humidity and 13,000- 
14,000 LUX. Covered flats were watered every three days for 7 days. Seedlings were 
carefully removed from the sand and placed in 1-liter beakers with 0.001% methyl jasmonate 
for treatment. Control plants were treated with water. After 24 hr, aerial and root tissues were 
separated and flash frozen in liquid nitrogen prior to storage at -80°C. 

(o) Salicylic Acid 

Seeds of Arabidopsis thaliana (ecotype Wassilewskija) were sown in trays and left at 
4°C for 4 days to vernalize before being transferred to a growth chamber having 16 hr light/8 
hr. dark, 13,000 LUX, 70% humidity, 20°C temperature and watered twice a week with 1 L 
of a IX Hoagland's solution. Approximately 1,000 14 day old plants were spayed with 200- 
250 mis of 5 mM salicylic acid (solubilized in 70% ethanol) in a 0.02% solution of the 
detergent Silwet L-77. At 1 hr and 6 hrs after treatment, whole seedlings, including roots, 
were harvested within a 15 to 20 minute time period flash-frozen in liquid nitrogen and stored 
at -80°C. 

Alternatively, seeds of wild-type Arabidopsis thaliana (ecotype Columbia) and 
mutant CS3726 were sown in soil type 200 mixed with osmocote fertilizer and Marathon 
insecticide and left at 4°C for 3 days to vernalize. Flats were incubated at room temperature 
with continuous light. Sixteen days post germination plants were sprayed with 2 mM SA, 
0.02% SilwettL-77 or control solution (0.02% SilwettL-77. Aerial parts or flowers were 
harvested 1 hr (EXPT REP: 108471 and 108472), 4 hr (EXPT REP: 108469 and 108470), 6 
hr (EXPT REP: 108440,) 24 hr (EXPT REP: 108443, 107953 and 107960) and 3 weeks 
(EXPT REP: 108475, 108476) post-treatment flash frozen and stored at -80°C. 
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Seeds of maize hybrid 35A (Pioneer) were sown in water-moistened sand in flats (10 
rows, 5-6 seed/row) and covered with clear, plastic lids before being placed in a growth 
chamber having 16 hr light (25°C)/8 hr dark (20°C), 75% relative humidity and 13,000- 
14,000 LUX. Covered flats were watered every three days for 7 days. Seedlings were 
carefully removed from the sand and placed in 1 -liter beakers with 2 mM SA for treatment. 
Control plants were treated with water. After 12 hr and 24 hr, aerial and root tissues were 
separated and flash frozen in liquid nitrogen prior to storage at -80°C. 

(p) Wounding 

Seeds of Arabidopsis thaliana (Wassilewskija) were sown in trays and left at 4°C for 
three days to vernalize before being transferred to a growth chamber having 16 hr light/8 hr 
dark, 12,000-14,000 LUX, 70% humidity and 20°C. After 14 days, the leaves were wounded 
with forceps. Aerial tissues were harvested 1 hour and 6 hours after wounding. Aerial 
tissues from unwounded plants served as controls. Tissues were flash^frozen in liquid 
nitrogen and stored at -80°C. 

Seeds of maize hybrid 35A (Pioneer) were sown in water-moistened sand in flats (10 
rows, 5-6 seed/row) and covered with clear, plastic lids before being placed in a growth 
chamber having 16 hr light (25°C)/8 hr dark (20°C), 75% relative humidity and 13,000- 
14,000 LUX. Covered flats were watered every three days for 7 days. Seedlings were 
wounded (one leaf nicked by scissors) and placed in 1 -liter beakers of water for treatment. 
Control plants were treated not wounded. After 1 hr and 6 hr aerial and root tissues were 
separated and flash frozen in liquid nitrogen prior to storage at -80°C. 

(q) Drought stress 

Seeds of Arabidopsis thaliana (Wassilewskija) were sown in pots and left at 4°C for 
three days to vernalize before being transferred to a growth chamber having 16 hr light/8 hr 
dark, 150,000-160,000 LUX, 20°C and 70% humidity. After 14 days, aerial tissues were cut 
and left to dry on 3MM Whatman paper in a petri-plate for 1 hour and 6 hours. Aerial tissues 
exposed for 1 hour and 6 hours to 3 MM Whatman paper wetted with IX Hoagland's solution 
served as controls. Tissues were harvested, flash- frozen in liquid nitrogen and stored at - 



Alternatively, Arabidopsis thaliana (Ws) seed was vernalized at 4° C for 3 days 
before sowing in Metromix soil type 350. Flats were placed in a growth chamber with 23°C, 
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16 hr light/8 hr. dark, 80% relative humidity, -1 3,000 LUX for germination and growth. 
Plants were watered with 1-1.5 L of water every four days. Watering was stopped 16 days 
after germination for the treated samples, but continued for the control samples. Rosette 
leaves and stems (EXPT REP 108477, 108482 and 108483), flowers (see EXPT REP: 
108473, 108474) and siliques were harvested 2 d, 3 d, 4 d, 5 d, 6 d and 7 d (EXPT REP: 

108473) after watering was stopped. Tissue was flash frozen in liquid nitrogen and kept at - 
80 °C until RNA was isolated. Flowers and siliques were also harvested on day 8 from plants 
that had undergone a 7 d drought treatment followed by 1 day of watering (EXPT REP: 

108474) . Control plants (whole plants) were harvested after 5 weeks, flash frozen in liquid 
nitrogen and stored as above. 

Seeds of maize hybrid 35A (Pioneer) were sown in water-moistened sand in flats (10 
rows, 5-6 seed/row) and covered with clear, plastic lids before being placed in a growth 
chamber having 16 hr light (25°C)/8 hr dark (20°C), 75% relative humidity and 13,000- 
14,000 LUX. Covered flats were watered every three days for 7 days. Seedlings were 
carefully removed from the sand and placed in empty 1 -liter beakers at room temperature for 
treatment. Control plants were placed in water. After 1 hr, 6 hr, 12 hr and 24 hr aerial and 
root tissues were separated and flash frozen in liquid nitrogen prior to storage at -80°C. 

(r) Osmotic stress 

Seeds of Arabidopsis thaliana (Wassilewskija) were sown in trays and left at 4°C for 
three days to vernalize before being transferred to a growth chamber having 16 hr light/8 hr 
dark, 12,000-14,000 LUX, 20°C, and 70% humidity. After 14 days, the aerial tissues were 
cut and placed on 3 MM Whatman paper in a petri-plate wetted with 20% PEG 
(polyethylene glycol-M r 8,000) in IX Hoagland's solution. Aerial tissues on 3 MM Whatman 
paper containing IX Hoagland's solution alone served as the control Aerial tissues were 
harvested at 1 hour and 6 hours after treatment, flash-frozen in liquid nitrogen and stored at 
-80°C. 

Seeds of maize hybrid 35 A (Pioneer) were sown in water-moistened sand in flats (10 
rows, 5-6 seed/row) and covered with clear, plastic lids before being placed in a growth 
chamber having 16 hr light (25°C)/8 hr dark (20°C), 75% relative humidity and 13,000- 
14,000 LUX. Covered flats were watered every three days for 7 days. Seedlings were 
carefully removed from the sand and placed in 1 -liter beakers with 20% PEG (polyethylene 
glycol-M r 8,000) for treatment. Control plants were treated with water. After 1 hr and 6 hr 
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aerial and root tissues were separated and flash frozen in liquid nitrogen prior to storage at - 



Seeds of maize hybrid 35A (Pioneer) were sown in water-moistened sand in flats (10 
rows, 5-6 seed/row) and covered with clear, plastic lids before being placed in a growth 
chamber having 16 hr light (25°C)/8 hr dark (20°C), 75% relative humidity and 13,000- 
14,000 LUX. Covered flats were watered every three days for 7 days. Seedlings were 
carefully removed from the sand and placed in 1 -liter beakers with 150mM NaCl for 
treatment. Control plants were treated with water. After 1 hr, 6hr, and 24 hr aerial and root 
tissues were separated and flash frozen in liquid nitrogen prior to storage at -80°C. 

(s) Heat Shock Treatment 

Seeds of Arabidopsis thaliana (Wassilewskija) were sown in trays and left at 4°C for 
three days to vernalize before being transferred to a growth chamber with 16 hr light/8 hr 
dark, 12,000-14,000 LUX, 70% humidity and 20°C. Fourteen day old plants were transferred 
to a 42°C growth chamber and aerial tissues were harvested 1 h and 6 h after transfer. 
Control plants were left at 20°C and aerial tissues were harvested. Tissues were flashfrozen 
in liquid nitrogen and stored at -80°C. 

Seeds of maize hybrid 35 A (Pioneer) were sown in water-moistened sand in flats (10 
rows, 5-6 seed/row) and covered with clear, plastic lids before being placed in a growth 
chamber having 16 hr light (25°C)/8 hr dark (20°C), 75% relative humidity and 13,000- 
14,000 LUX. Covered flats were watered every three days for 7 days. Seedlings were 
carefully removed from the sand and placed in 1 -liter beakers containing 42°C water for 
treatment. Control plants were treated with water at 25°C. After 1 hr and 6 hr aerial and root 
tissues were separated and flash frozen in liquid nitrogen prior to storage at -80°C. 

(t) Cold Shock Treatment 

Seeds of Arabidopsis thaliana (Wassilewskija) were sown in trays and left at 4°C for 
three days to vernalize before being transferred to a growth chamber having 16 hr light/8 hr 
dark, 12,000-14,000 LUX, 20°C and 70% humidity. Fourteen day old plants were transferred 
to a 4°C dark growth chamber and aerial tissues were harvested 1 hour and 6 hours later. 
Control plants were maintained at 20°C and covered with foil to avoid exposure to light. 
Tissues were flash-frozen in liquid nitrogen and stored at -80°C. 



80°C. 
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Seeds of maize hybrid 35 A (Pioneer) were sown in water-moistened sand in flats (10 
rows, 5-6 seed/row) and covered with clear, plastic lids before being placed in a growth 
chamber having 16 hr light (25°C)/8 hr dark (20°C), 75% relative humidity and 13,000- 
14,000 LUX. Covered flats were watered every three days for 7 days. Seedlings were 
carefully removed from the sand and placed in 1 -liter beakers containing 4°C water for 
treatment. Control plants were treated with water at 25°C. After 1 hr and 6 hr aerial and root 
tissues were separated and flash frozen in liquid nitrogen prior to storage at -80°C. 




(u) Oxidative Stress- Hydrogen Peroxide Treatment 

Seeds of Arabidopsis thaliana (Wassilewskija) were sown in trays and left at 4°C for 
three days to vernalize, before being transferred to a growth chamber having 16 hr light/8 hr 
dark, 12,000-14,000 LUX, 20°C and 70% humidity. Fourteen day old plants were sprayed 
with 5 mM H 2 0 2 (hydrogen peroxide) in a 0.02% Silwett L-77 solution. Control plants were 
sprayed with a 0.02% Silwett L-77 solution. Aerial tissues were harvested 1 hour and 6 hours 
after spraying, flash- frozen in liquid nitrogen and stored at -80°C. 

Seeds of maize hybrid 35 A (Pioneer) were sown in water-moistened sand in flats (10 
rows, 5-6 seed/row) and covered with clear, plastic lids before being placed in a growth 
chamber having 16 hr light (25°C)/8 hr dark (20°C), 75% relative humidity and 13,000- 
14,000 LUX. Covered flats were watered every three days for 7 days. Seedlings were 
carefully removed from the sand and placed in 1 -liter beakers with 5 mM H2O2 for treatment. 
Control plants were treated with water. After 1 hr, 6 hr and 24 hr, aerial and root tissues were 
separated and flash frozen in liquid nitrogen prior to storage at -80°C. 



(v) Nitric Oxide Treatment 

Seeds of Arabidopsis thaliana (Wassilewskija) were sown in trays and left at 4°C for 
three days to vernalize before being transferred to a growth chamber having 16 hr light/8 hr 
dark, 12,000-14,000 LUX, 20°C and 70% humidity. Fourteen day old plants were sprayed 
with 5 mM sodium nitroprusside in a 0.02% Silwett L-77 solution. Control plants were 
sprayed with a 0.02% Silwett L-77solution. Aerial tissues were harvested 1 hour and 6 hours 
after spraying, flash- frozen in liquid nitrogen and stored at -80°C. 

Seeds of maize hybrid 35A (Pioneer) were sown in water-moistened sand in flats (10 
rows, 5-6 seed/row) and covered with clear, plastic lids before being placed in a growth 
chamber having 16 hr light (25°C)/8 hr dark (20°C), 75% relative humidity and 13,000- 
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14,000 LUX. Covered flats were watered every three days for 7 days. Seedlings were 
carefully removed from the sand and placed in 1 -liter beakers with 5 mM nitroprusside for 
treatment. Control plants were treated with water. After 1 hr 3 6 hr and 12 hr, aerial and root 
tissues were separated and flash frozen in liquid nitrogen prior to storage at -80°C. 

(w) S4 Immature Buds, Inflorescence Meristem 

Seeds of Arabidopsis thaliana (ecotype Wassilewskija) were sown in pots and left at 
4°C for two to three days to vernalize. They were then transferred to a growth chamber. 
Plants were grown under long-day (16 hr light: 8 hr dark) conditions, 7000-8000 LUX light 
intensity, 70% humidity, and 22°C temperature. Inflorescences containing immature floral 
buds [stages 1-12; Smyth et al. ? 1990] as well as the inflorescence meristem were harvested 
and flash frozen in liquid nitrogen. 

(x) S5 Flowers (Opened) 

Seeds of Arabidopsis thaliana (ecotype Wassilewskija) were sown in pots and left at 
4°C for two to three days to vernalize. They were then transferred to a growth chamber. 
Plants were grown under long-day (16 hr light: 8 hr dark) conditions, 7000-8000 LUX light 
intensity, 70% humidity, and 22°C temperature. Mature, unpollinated flowers [stages 12-14; 
Smyth et al. 1990] were harvested and flash frozen in liquid nitrogen. 

(y) S6 Siliques (All Stages) 

Seeds of Arabidopsis thaliana (ecotype Wassilewskija) were sown in pots and left at 
4°C for two to three days to vernalize. They were then transferred to a growth chamber. 
Plants were grown under long-day (16 hr light: 8 hr dark) conditions, 7000-8000 LUX light 
intensity, 70% humidity, and 22°C temperature. Siliques bearing developing seeds 
containing post fertilization through pre-heart stage [0-72 hours after fertilization (HAF)], 
heart- through early curled cotyledon stage [72-120 HAF] and late-curled cotyledon stage 
[>120 HAF] embryos were harvested separately and pooled prior to RNA isolation in a mass 
ratio of 1:1:1. The tissues were then flash frozen in liquid nitrogen. Description of the stages 
of Arabidopsis embryogenesis used were reviewed by Bowman (1994). 

(z) ARABIDOPSIS ENDOSPERM 
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me a/me a Fruits 0-10 mm 

Seeds of Arabidopsis thaliana heterozygous for the fertilization- independent 
endosperml (fiel) [Ohad et al., 1996; ecotype Landsberg erecta (Ler)] were sown in pots and 
left at 4°C for two to three days to vernalize. Kiyosue et al. (1999) subsequently determined 
that fiel was allelic to the gametophytic maternal effect mutant medea (Grossniklaus et al., 
1998). Imbibed seeds were then transferred to a growth chamber. Plants were grown under 
long-day (16 hr light: 8 hr dark) conditions, 7000-8000 LUX light intensity, 70% humidity, 
and 22°C temperature. 1-2 siliques (fruits) bearing developing seeds just prior to dessication 
[9 days after flowering (DAF)] were selected from each plant and were hand-dissected to 
identify wild-type, mea/+ heterozygotes, and mea/mea homozygous mutant plants. At this 
stage, homozygous mea/mea plants produce short siliques that contain >70% aborted seed 
and can be distinguished from those produced by wild-type (100% viable seed) and mea/+ 
heterozygous (50% viable seed) plants (Ohad et al., 1996; Grossniklaus et ah, 1998; Kiyosue 
et al., 1999). Siliques 0-10 mm in length containing developing seeds 0-9 DAF produced by 
homozygous mea/mea plants were harvested and flash frozen in liquid nitrogen. 

Pods 0-10 mm (Control Tissue for Sample 70) 

Seeds of Arabidopsis thaliana heterozygous for the fertilization- independent 
endosperml (fiel) [Ohad et al., 1996; ecotype Landsberg erecta (her)] were sown in pots and 
left at 4°C for two to three days to vernalize. Kiyosue et al. (1999) subsequently determined 
that fiel was allelic to the gametophytic maternal effect mutant medea (Grossniklaus et al, 
1998). Imbibed seeds were then transferred to a growth chamber. Plants were grown under 
long-day (16 hr light: 8 hr dark) conditions, 7000-8000 LUX light intensity, 70% humidity, 
and 22°C temperature. 1-2 siliques (fruits) bearing developing seeds just prior to dessication 
[9 days after flowering (DAF)] were selected from each plant and were hand-dissected to 
identify wild-type, mea/+ heterozygotes, and mea/mea homozygous mutant plants. At this 
stage, homozygous mea/mea plants produce short siliques that contain >70% aborted seed 
and can be distinguished from those produced by wild-type (100% viable seed) and mea/+ 
heterozygous (50% viable seed) plants (Ohad et al., 1996; Grossniklaus et al., 1998; Kiyosue 
et al., 1999). Siliques 0-10 mm in length containing developing seeds 0-9 DAF produced by 
segregating wild-type plants were opened and the seeds removed. The remaining tissues 



(pods minus seed) were harvested and flash frozen in liquid nitrogen. 
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(aa) ARABIDOPSIS SEEDS 
Fruits (pod + seed) 0-5 mm 

Seeds of Arabidopsis thaliana (ecotype Wassilewskija) were sown in pots and left at 
4°C for two to three days to vernalize. They were then transferred to a growth chamber. 
Plants were grown under long-day (16 hr light: 8 hr dark) conditions, 7000-8000 LUX light 
intensity, 70% humidity, and 22°C temperature. 3-4 siliques (fruits) bearing developing 
seeds were selected from at least 3 plants and were hand-dissected to determine what 
developmental stage(s) were represented by the enclosed embryos. Description of the stages 
of Arabidopsis embryogenesis used in this determination were summarized by Bowman 
(1994). Silique lengths were then determined and used as an approximate determinant for 
embryonic stage. Siliques 0-5 mm in length containing post fertilization through pre-heart 
stage [0-72 hours after fertilization (HAF)] embryos were harvested and flash frozen in liquid 
nitrogen. 

Fruits(pod 4- seed) 5-10 mm 

Seeds of Arabidopsis thaliana (ecotype Wassilewskija) were sown in pots and left at 
4°C for two to three days to vernalize. They were then transferred to a growth chamber. 
Plants were grown under long-day (16 hr light: 8 hr dark) conditions, 7000-8000 LUX light 
intensity, 70% humidity, and 22°C temperature. 3-4 siliques (fruits) bearing developing 
seeds were selected from at least 3 plants and were hand-dissected to determine what 
developmental stage(s) were represented by the enclosed embryos. Description of the stages 
of Arabidopsis embryogenesis used in this determination were summarized by Bowman 
(1994). Silique lengths were then determined and used as an approximate determinant for 
embryonic stage. Siliques 5-10 mm in length containing heart- through early upturned-U- 
stage [72-120 hours after fertilization (HAF)] embryos were harvested and flash frozen in 
liquid nitrogen. 

Fruits(pod + seed) >10 mm 

Seeds of Arabidopsis thaliana (ecotype Wassilewskija) were sown in pots and left at 
4°C for two to three days to vernalize. They were then transferred to a growth chamber. 
Plants were grown under long-day (16 hr light: 8 hr dark) conditions, 7000-8000 LUX light 
intensity, 70% humidity, and 22°C temperature. 3-4 siliques (fruits) bearing developing 
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seeds were selected from at least 3 plants and were hand-dissected to determine what 
developmental stage(s) were represented by the enclosed embryos. Description of the stages 
of Arabidopsis embryogenesis used in this determination were summarized by Bowman 
(1994). Silique lengths were then determined and used as an approximate determinant for 
5 embryonic stage. Siliques >10 mm in length containing green, late upturned-U- stage [>120 
hours after fertilization (HAF)-9 days after flowering (DAF)] embryos were harvested and 
flash frozen in liquid nitrogen. 




10 Green Pods 5-10 mm (Control Tissue for Samples 72-74) 

Seeds of Arabidopsis thaliana (ecotype Wassilewskija) were sown in pots and left at 
4°C for two to three days to vernalize. They were then transferred to a growth chamber. 
Plants were grown under long-day (16 hr light: 8 hr dark) conditions, 7000-8000 LUX light 
P intensity, 70% humidity, and 22°C temperature. 3-4 siliques (fruits) bearing developing 
J'5 seeds were selected from at least 3 plants and were hand-dissected to determine what 
J* developmental stage(s) were represented by the enclosed embryos. Description of the stages 
Rj of Arabidopsis embryogenesis used in this determination were summarized by Bowman 

(1994). Silique lengths were then determined and used as an approximate determinant for 
i embryonic stage. Green siliques 5-10 mm in length containing developing seeds 72-120 

V*\ 

§£> hours after fertilization (HAF)] were opened and the seeds removed. The remaining tissues 
(green pods minus seed) were harvested and flash frozen in liquid nitrogen. 

Green Seeds from Fruits >10 mm 

25 Seeds of Arabidopsis thaliana (ecotype Wassilewskija) were sown in pots and left at 

4°C for two to three days to vernalize. They were then transferred to a growth chamber. 
Plants were grown under long-day (16 hr light: 8 hr dark) conditions, 7000-8000 LUX light 
intensity, 70% humidity, and 22°C temperature. 3-4 siliques (fruits) bearing developing 
seeds were selected from at least 3 plants and were hand-dissected to determine what 

30 developmental stage(s) were represented by the enclosed embryos. Description of the stages 
of Arabidopsis embryogenesis used in this determination were summarized by Bowman 
(1994). Silique lengths were then determined and used as an approximate determinant for 
embryonic stage. Green siliques >10 mm in length containing developing seeds up to 9 days 
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after flowering (DAF)] were opened and the seeds removed and harvested and flash frozen in 
liquid nitrogen. 



Brown Seeds from Fruits >10 mm 

Seeds of Arabidopsis thaliana (ecotype Wassilewskija) were sown in pots and left at 
4°C for two to three days to vernalize. They were then transferred to a growth chamber. 
Plants were grown under long-day (16 hr light: 8 hr dark) conditions, 7000-8000 LUX light 
intensity, 70% humidity, and 22°C temperature. 3-4 siliques (fruits) bearing developing 
seeds were selected from at least 3 plants and were hand-dissected to determine what 
developmental stage(s) were represented by the enclosed embryos. Description of the stages 
of Arabidopsis embryogenesis used in this determination were summarized by Bowman 
(1994). Silique lengths were then determined and used as an approximate determinant for 
embryonic stage. Yellowing siliques >10 mm in length containing brown, dessicating seeds 
>1 1 days after flowering (DAF)] were opened and the seeds removed and harvested and flash 
frozen in liquid nitrogen. 



Green/Brown Seeds from Fruits >10 mm 

Seeds of Arabidopsis thaliana (ecotype Wassilewskija) were sown in pots and left at 
4°C for two to three days to vernalize. They were then transferred to a growth chamber. 
Plants were grown under long-day (16 hr light: 8 hr dark) conditions, 7000-8000 LUX light 
intensity, 70% humidity, and 22°C temperature. 3-4 siliques (fruits) bearing developing 
seeds were selected from at least 3 plants and were hand-dissected to determine what 
developmental stage(s) were represented by the enclosed embryos. Description of the stages 
of Arabidopsis embryogenesis used in this determination were summarized by Bowman 
(1994). Silique lengths were then determined and used as an approximate determinant for 
embryonic stage. Green siliques >10 mm in length containing both green and brown seeds 
>9 days after flowering (DAF)] were opened and the seeds removed and harvested and flash 
frozen in liquid nitrogen. 
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Mature Seeds (24 hours after imbibition) 

Mature dry seeds of Arabidopsis thaliana (ecotype Wassilewskija) were sown onto 
moistened filter paper and left at 4°C for two to three days to vernalize. Imbibed seeds were 
then transferred to a growth chamber [16 hr light: 8 hr dark conditions, 7000-8000 LUX light 
intensity, 70% humidity, and 22°C temperature], the emerging seedlings harvested after 48 
hours and flash frozen in liquid nitrogen. 



Mature Seeds (Dry) 

Seeds of Arabidopsis thaliana (ecotype Wassilewskija) were sown in pots and left at 
4°C for two to three days to vernalize. They were then transferred to a growth chamber. 
Plants were grown under long-day (16 hr light: 8 hr dark) conditions, 7000-8000 LUX light 
intensity, 70% humidity, and 22°C temperature and taken to maturity. Mature dry seeds are 
collected, dried for one week at 28°C, and vernalized for one week at 4°C before used as a 
source of RNA. 

Ovules 

Seeds of Arabidopsis thaliana heterozygous for pistillata (pi) [ecotype Landsberg 
erecta (Ler)] were sown in pots and left at 4°C for two to three days to vernalize. They were 
then transferred to a growth chamber. Plants were grown under long-day (16 hr light: 8 hr 
dark) conditions, 7000-8000 LUX light intensity, 76% humidity, and 24°C temperature. 
Inflorescences were harvested from seedlings about 40 days old. The inflorescences were cut 
into small pieces and incubated in the following enzyme solution (pH 5) at room temperature 
for 0.5-1 hr.: 0.2% pectolyase Y-23, 0.04% pectinase, 5 mM MES, 3% Sucrose and MS salts 
(1900 mg/1 KN0 3 , 1650 mg/1 NH4NO3, 370 mg/1 MgS0 4 • 7 H 2 0, 170 mg/1 KH 2 P0 4 , 440 
mg/1 CaCl 2 • 2 H 2 0, 6.2 mg/1 H 2 B0 3 , 15.6 mg/1 MnS0 4 • 4 H 2 0, 8.6 mg/1 ZnS0 4 • 7 H 2 0, 
0.25 mg/1 NaMo0 4 • 2 H 2 0, 0.025 mg/1 CuC0 4 • 5 H 2 0, 0.025 mg/1 CoCl 2 • 6 H 2 0, 0.83 
mg/1 KI, 27.8 mg/1 FeS0 4 • 7 H 2 0, 37.3 mg/1 Disodium EDTA, pH 5.8). At the end of the 
incubation the mixture of inflorescence material and enzyme solution was passed through a 
size 60 sieve and then through a sieve with a pore size of 125 |im. Ovules greater than 125 
(am in diameter were collected, rinsed twice in B5 liquid medium (2500 mg/1 KNO3, 250 mg/1 
MgS0 4 • 7 H 2 0, 150 mg/1 NaH2P04 • H 2 0, 150 mg/1 CaCl 2 • 2 H 2 0, 134 mg/1 (NH4)2 
CaCl 2 • S0 4 , 3 mg/1 H 2 B0 3 , 10 mg/1 MnS0 4 • 4 H 2 0, 2 ZnS0 4 • 7 H 2 0, 0.25 mg/1 NaMo0 4 
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• 2 H 2 0, 0.025 mg/1 CuC0 4 • 5 H 2 0, 0.025 mg/1 CoCl 2 • 6 H 2 0, 0.75 mg/1 KI, 40 mg/1 
EDTA sodium ferric salt, 20 g/1 sucrose, 10 mg/1 Thiamine hydrochloride, 1 mg/1 Pyridoxine 
hydrochloride, 1 mg/1 Nicotinic acid, 100 mg/1 myo-inositol, pH 5.5)), rinsed once in 
deionized water and flash frozen in liquid nitrogen. The supernatant from the 125 Jim sieving 
was passed through subsequent sieves of 50 jam and 32 |im. The tissue retained in the 32 |im 
sieve was collected and mRNA prepared for use as a control. 



(bb) Herbicide treament 

Arabidopsis thaliana (Ws) seeds were sterilized for 5 min. with 30% bleach, 50 ^il 
Triton in a total volume of 50 ml. Seeds were vernalized at 4°C for 3 days before being plated 
onto GM agar plates at a density of about 144 seeds per plate. Plates were incubated in a 
Percival growth chamber having 16 hr light/8 hr dark, 80% relative humidity, 22 °C and 
11,000 LUX for 14 days. 

Plates were sprayed (-0.5 mis/plate) with water, Finale (1.128 g/L), Glean (1.88 g/L), 
RoundUp (0.01 g/L) or Trimec (0.08 g/L). Tissue was collected and flash frozen in liquid 
nitrogen at the following time points: 0, 1, 2, 4 (EXPT REP: 107871 (Finale), 107881 
(Glean), 107896 (Round-up) and 107886 (Trimec)), 8, 12(EXPT REP: 108467 (Finale), 
108468 (Glean), 108465 (Round-up) and 108466, 107891 (Trimec)), and 24 hours. Frozen 
tissue was stored at -80°C prior to RNA isolation. 

(cc) Ap2 

Seeds of Arabidopsis thaliana (ecotype Landesberg erecta) and floral mutant apetala2 
(Jofiiku et al., 1994, Plant Cell 6:121 1-1225) were sown in pots and left at 4°C for two to 
three days to vernalize. They were then transferred to a growth chamber. Plants were grown 
under long-day (16 hr light, 8 hr dark) conditions 7000-8000 LUX light intensity, 70% 
humidity and 22 °C temperature. Inflorescences containing immature floral buds (stages 1-7; 
Bowman, 1994) as wel as the inflorescence meristem were harvested and flashfrozen. 
Polysomal polyA+ RNA was isolated from tissue according to Cox and Goldberg, 1988). 

(dd) Protein Degradation 

Arabidopsis thaliana (ecotype Ws) wild-type and 13B12-1 (homozygous) mutant 
seed were sown in pots containing Metro-mix 350 soil and incubated at 4°C for four days. 
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Vernalized seeds were germinated in the greenhouse (16 hr light/8 hr dark) over a 7 day 
period. Mutant seedlings were sprayed with 0.02% (active ingredient) Finale to confirm their 
transgenic standing. Plants were grown until the mutant phenotype (either multiple pistils in a 
single flower and/or multiple branching per node) was apparent. Young inflorescences 
5 immediately forming from the multiple-branched stems were cut and flash frozen in liquid 
nitrogen. Young inflorescences from wild-type plants grown in parallel and under identical 
conditions were collected as controls. All collected tissue was stored at -80°C until RNA 
isolation. (EXPT REP 108451) 



10 (ee) Root tips 

Seeds of Arabidopsis thaliana (ecotye Ws) were placed on MS plates and vernalized 
at 4°C for 3 days before being placed in a 25°C growth chamber having 16 hr light/8 hr dark, 
^ 70% relative humidty and about 3 W/m 2 . After 6 days, young seedlings were transferred to 
*S flasks containing B5 liquid medium, 1% sucrose and 0.05 mg/1 indole-3 -butyric acid. Flasks 
% were incubated at room temperature with 100 rpm agitation. Media was replaced weekly. 

u After three weeks, roots were harvested and incubated for 1 hr with 2% pectinase, 0.2% 

W 

In cellulase, pH 7 before straining through a #80 (Sigma) sieve. The root body material 

* remaining on the sieve (used as the control) was flash frozen and stored at -80°C until use. 

G 1 The material that passed through the #80 sieve was strained through a #200 (Sigma) sieve 

W 

Wd and the material remaining on the sieve (root tips) was flash frozen and stored at -80°C until 

jt 

2»; use. Approximately 10 mg of root tips were collected from one flask of root culture. 

Seeds of maize hybrid 35A (Pioneer) were sown in water-moistened sand in flats (10 
rows, 5-6 seed/row) and covered with clear, plastic lids before being placed in a growth 
chamber having 16 hr light (25°C)/8 hr dark (20°C), 75% relative humidity and 13,000- 
25 14,000 LUX. Covered flats were watered every three days for 8 days. Seedlings were 

carefully removed from the sand and the root tips (~2 mm long) were removed and flash 
frozen in liquid nitrogen prior to storage at -80°C. The tissues above the root tips (-1 cm 
long) were cut, treated as above and used as control tissue. 



30 



(ff) rtl 

The rtl allele is a variation of rtl rootless 1 and is recessive, 
phenotype have few or no secondary roots. 



Plants displaying the rtl 
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Seed from plants segregating for rtl were sown on sand and placed in a growth 
chamber having 16 hr light/8 hr dark, 13,000 LUX, 70% humidity and 20°C temperature. 
Plants were watered every three days with tap water. Eleven (11) day old seedlings were 
carefully removed from the sand, keeping the roots intact, rt7-type seedlings were separated 
from their wild-type counterparts and the root tissue isolated. Root tissue from normal 
seedlings (control) and rtl mutants were flash frozen in liquid nitrogen and stored at -80°C 
until use. 

(gg) Imbibed seed 

Seeds of maize hybrid 35 A (Pioneer) were sown in water-moistened sand in covered 
flats (10 rows, 5-6 seed/row) and covered with clear, plastic lids before being placed in a 
growth chamber having 16 hr light (25°C)/8 hr dark (20°C), 75% relative humidity and 
13,000-14,000 LUX. One day after sowing, whole seeds were flash frozen in liquid nitrogen 
prior to storage at -80°C. Two days after sowing, embryos and endosperm were isolated and 
flash frozen in liquid nitrogen prior to storage at -80°C. On days 3-6, aerial tissues, roots and 
endosperm were isolated and flash frozen in liquid nitrogen prior to storage at -80°C. 

(hh) Rough Sheath2-R (rs2-R) Mutants (1400-6/S-17) 

This experiment was conducted to identify abnormally expressed genes in the shoot apex 
of rough sheath2-R (rs2-R) mutant plants. rs2 encodes a myb domain DNA binding protein 
that functions in repression of several shoot apical meristem expressed homeobox genes. Two 
homeobox gene targets are known for rs2 repression, rough sheathl, liguleless 3. The 
recessive loss of function phenotype of rs2-R homozygous plants is described in 
Schneeberger et al. 1998 Development 125: 2857-2865. 

The seed stock genetically segregates 1:1 for rs2-R/rs2-R : rs2-R/+ 
Preparation of tissue samples: 160 seedlings pooled from 2 and 3 week old plants grown 
in sand. Growth conditions; Conviron #107 @ 12 hr days/12hr night, 25°C, 75% humidity. 
Shoot apex was dissected to include leaf three and older. (Pictures available upon request). 

1) rough sheath2-R homozygous (mutant) shoot apex 

2) rough sheath2-R heterozygous (wt, control) shoot apex 

(ii) Leaf Mutant 3642: 
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Mutant 3642 is a recessive mutation that causes abnormal leaf development. The 
leaves of mutant 3642 plants are characterized by leaf twisting and irregular leaf shape. 
Mutant 3642 plants also exhibit abnormally shaped floral organs which results in reduced 
fertility. 

Seed segregating for the mutant phenotype was sown in Metro-mix 350 soil 
and grown in a Conviron growth chamber with watering by sub-irrigation twice a 
week. Environmental conditions were set at 20 degrees Celsius, 70% humidity with 
an 8 hour day, 16 hour night light regime. Plants were harvested after 4 weeks of 
growth and the entire aerial portion of the plant was harvested and immediately frozen 
in liquid nitrogen and stored at -80C. Mutant phenotype plants were harvested 
separately from normal phenotype plants, which serve as the control tissue. 




(jj) Flowers (green, white or buds) 

Approximately 10 jal of Arabidopsis thaliana seeds (ecotype Ws) were sown on 350 
soil (containing 0.03% marathon) and vernalized at 4C for 3 days. Plants were then grown at 
room temperature under fluorescent lighting until flowering. Flowers were harvested after 28 
days in three different categories. Buds that had not opened at all and were completely green 
were categorized as "flower buds" (also referred to as green buds by the investigator). Buds 
that had started to open, with white petals emerging slightly were categorized as "green 
flowers" (also referred to as white buds by the investigator). Flowers that had opened 
mostly (with no silique elongation) with white petals completely visible were categorized as 
"white flowers" (also referred to as open flowers by the investigator). Buds and flowers were 
harvested with forceps, flash frozen in liquid nitrogen and stored at -80C until RNA was 
isolated. 



2. Microarray Hybridization Procedures 



Microarray technology provides the ability to monitor mRNA transcript levels of 
thousands of genes in a single experiment. These experiments simultaneously hybridize two 
differentially labeled fluorescent cDNA pools to glass slides that have been previously 
spotted with cDNA clones of the same species. Each arrayed cDNA spot will have a 
corresponding ratio of fluorescence that represents the level of disparity between the 
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respective mRNA species in the two sample pools. Thousands of polynucleotides can be 
spotted on one slide, and each experiment generates a global expression pattern. 

COATING SLIDES 

The microarray consists of a chemically coated microscope slide, referred herein as a "chip" 
with numerous polynucleotide samples arrayed at a high density. The poly-L-lysine coating 
allows for this spotting at high density by providing a hydrophobic surface, reducing the 
spreading of spots of DNA solution arrayed on the slides. Glass microscope slides (Gold 
Seal #3010 manufactured by Gold Seal Products, Portsmouth, New Hampshire, USA) were 
coated with a 0.1%W/V solution of Poly-L-lysine (Sigma, St. Louis, Missouri) using the 
following protocol: 

Slides were placed in slide racks (Shandon Lipshaw #121). The racks were then put in 

chambers (Shandon Lipshaw #121). 

Cleaning solution was prepared: 

70 g NaOH was dissolved in 280 mL ddH20. 

420 mL 95% ethanol was added. The total volume was 700 mL (= 2 X 350 mL); it was 
stirred until completely mixed. 

If the solution remained cloudy, ddH 2 0 was added until clear. 

3. The solution was poured into chambers with slides; the chambers were covered with glass 
lids. The solution was mixed on an orbital shaker for 2 hr. 

4. The racks were quickly transferred to fresh chambers filled with ddH 2 0. They were 
rinsed vigorously by plunging racks up and down. 

Rinses were repeated 4X with fresh ddH 2 0 each time, to remove all traces of NaOH- 
ethanol. 

5. Polylysine solution was prepared: 

70 mL poly-L-lysine + 70 mL tissue culture PBS in 560 mL water, using plastic 
graduated cylinder and beaker. 

6. Slides were transferred to polylysine solution and shaken for 1 hr. 

7. The rack was transferred to a fresh chambers filled with ddH 2 0. It was plunged up and 
down 5X to rinse. 
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8. The slides were centrifiiged on microtiter plate carriers (paper towels were placed below 
the rack to absorb liquid) for 5 min. @ 500 rpm. The slide racks were transferred to empty 
chambers with covers. 

9. Slide racks were dried in a 45C oven for 10 min. 

10. The slides were stored in a closed plastic slide box. 

11. Normally, the surface of lysine coated slides was not very hydrophobic immediately after 
this process, but became increasingly hydrophobic with storage. A hydrophobic surface 
helped ensure that spots didn't run together while printing at high densities. After they aged 
for 10 days to a month the slides were ready to use. However, coated slides that have been 
sitting around for long periods of time were usually too old to be used. This was because 
they developed opaque patches, visible when held to the light, and these resulted in high 
background hybridization from the fluorescent probe. 

Alternativey, precoated glass slides were purchased from TeleChem Internation, Inc. 
(Sunnyvale, CA, 94089; catalog number SMM-25, Superamine substrates). 

PCR AMPLIFICATION OF CDNA CLONE INSERTS 
Polynucleotides were amplified from Arabidopsis cDNA clones using insert specific probes. 
The resulting lOOuL PCR reactions were purified with Qiaquick 96 PCR purification 
columns (Qiagen, Valencia, California, USA) and eluted in 30 uL of 5mM Tris. 8.5uL of the 
elution were mixed with 1.5uL of 20X SSC to give a final spotting solution of DNA in 3X 
SSC. The concentrations of DNA generated from each clone varied between 10-100 ng/ul, 
but were usually about 50 ng/ul. 

ARRAYING OF PCR PRODUCTS ON GLASS SLIDES 

PCR products from cDNA clones were spotted onto the poly-L-Lysine coated glass slides 
using an arrangement of quill-tip pins (ChipMaker 3 spotting pins; Telechem, International, 
Inc., Sunnyvale, California, USA) and a robotic arrayer (PixSys 3500, Cartesian 
Technologies, Irvine, California, USA). Around 0.5 nl of a prepared PCR product was 
spotted at each location to produce spots with approximately lOOum diameters. Spot center- 
to-center spacing was from 180 urn to 210um depending on the array. Printing was 
conducted in a chamber with relative humidity set at 50%. 



Slides containing maize sequences were purchased from Agilent Technology (Palo 
Alto, CA 94304). 
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POST-PROCESSING OF SLIDES 

After arraying, slides were processed through a series of steps - rehydration, UV cross- 
linking, blocking and denaturation - required prior to hybridization. Slides were rehydrated 
by placing them over a beaker of warm water (DNA face down), for 2-3 sec, to distribute the 
DNA more evenly within the spots, and then snap dried on a hot plate (DNA side, face up). 
The DNA was then cross-linked to the slides by UV irradiation (60-65mJ; 2400 Stratalinker, 
Stratagene, La Jolla, California, USA). 

Following this a blocking step was performed to modify remaining free lysine groups, and 
hence minimize their ability to bind labeled probe DNA. To achieve this the arrays were 
placed in a slide rack. An empty slide chamber was left ready on an orbital shaker. The rack 
was bent slightly inwards in the middle, to ensure the slides would not run into each other 
while shaking. The blocking solution was prepared as follows: 

3x 350-ml glass chambers (with metal tops) were set to one side, and a large round Pyrex 
dish with dH 2 0 was placed ready in the microwave. At this time, 15ml sodium borate was 
prepared in a 50 ml conical tube. 

6-g succinic anhydride was dissolved in approx. 325-350 mL l-methyl-2-pyrrolidinone. 
Rapid addition of reagent was crucial. 

a. Immediately after the last flake of the succinic anhydride dissolved, the 15-mL sodium 
borate was added. 

b. Immediately after the sodium borate solution mixed in, the solution was poured into an 
empty slide chamber. 

c. The slide rack was plunged rapidly and evenly in the solution. It was vigorously shaken up 
and down for a few seconds, making sure slides never left the solution. 

d. It was mixed on an orbital shaker for 15-20 min. Meanwhile, the water in the Pyrex dish 
(enough to cover slide rack) was heated to boiling. 

Following this, the slide rack was gently plunge in the 95C water (just stopped boiling) for 2 
min. Then the slide rack was plunged 5X in 95% ethanol. The slides and rack were 
centrifuged for 5 min. @ 500 rpm. The slides were loaded quickly and evenly onto the 
carriers to avoid streaking. The arrays were used immediately or store in slide box. 
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The Hybridization process began with the isolation of mRNA from the two tissues (see 
"Isolation of total RNA " and "Isolation of mRNA below) in question followed by their 
conversion to single stranded cDNA (see "Generation of probes for hybridization below). 
The cDNA from each tissue was independently labeled with a different fluorescent dye and 
then both samples were pooled together. This final differentially labeled cDNA pool was then 
placed on a processed microarray and allowed to hybridize (see "Hybridization and wash 
conditions", below). 



ISOLATION OF TOTAL RNA 

Approximately 1 g of plant tissue was ground in liquid nitrogen to a fine powder and 
transferred into a 50-ml centrifuge tube containing 10 ml of Trizol reagent. The tube was 
vigorously vortexed for 1 min and then incubated at room temperature for 10-20 min. on an 
orbital shaker at 220 rpm. Two ml of chloroform was added to the tube and the solution 
vortexed vigorously for at least 30-sec before again incubating at room temperature with 
shaking. The sample was then centrifuged at 12,000 X g (10,000 rpm) for 15-20 min at 4°C. 
The aqueous layer was removed and mixed by inversion with 2.5 ml of 1 .2 M NaCl/0.8 M 
Sodium Citrate and 2.5 ml of isopropyl alcohol added. After a 10 min. incubation at room 
temperature, the sample was centrifuged at 12,000 X g (10,000 rpm) for 15 min at 4°C. The 
pellet was washed with 70% ethanol, re-centrifiiged at 8,000 rpm for 5 min and then air dried 
at room temperature for 10 min. The resulting total RNA was dissolved in either TE (10 mM 
Tris-HCl, 1 mM EDTA, pH 8.0) or DEPC (diethylpyrocarbonate) treated deionized water 
(RNAse-free water). For subsequent isolation of mRNA using the Qiagen kit, the total RNA 
pellet was dissolved in RNAse-free water. 

ISOLATION OF mRNA 

mRNA was isolated using the Qiagen Oligotex mRNA Spin-Column protocol 
(Qiagen, Valencia,California). Briefly, 500 \i\ OBB buffer (20 mM Tris-Cl, pH 7.5, 1 M 
NaCl, 2 mM EDTA, 0.2% SDS) was added to 500 jml of total RNA (0.5 - 0.75 mg) and 
mixed thoroughly. The sample was first incubated at 70°C for 3 min, then at room 
temperature for 10 minutes and finally centrifuged for 2 min at 14,000 - 18,000 X g. The 
pellet was resuspended in 400 |il OW2 buffer (10 mM Tris-Cl, pH 7.5, 150 mM NaCl, 1 mM 
EDTA) by vortexing, the resulting solution placed on a small spin column in a 1.5 ml RNase- 
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free microcentrifuge tube and centrifuged for 1 min at 14,000 - 18,000 X g. The spin column 
was transferred to a new 1 .5 ml RNase-free microcentrifuge tube and washed with 400 jal of 
OW2 buffer. To release the isolated mRNA from the resin, the spin column was again 
transferred to a new RNase-free 1.5 ml microcentrifuge tube, 20-100 ^il 70°C OEB buffer (5 
mM Tris-Cl, pH 7.5) added and the resin resuspended in the resulting solution via pipeting. 
The mRNA solution was collected after centrifuging for 1 min at 14,000 - 18,000 X g. 

Alternatively, mRNA was isolated using the Stratagene Poly(A) Quik mRNA 
Isolation Kit (Startagene, La Jolla, California). Here, up to 0.5 mg of total RNA (maximum 
volume of 1 ml) was incubated at 65°C for 5 minutes, snap cooled on ice and 0.1X volumes 
of 10X sample buffer (lOmM Tris-HCl (pH 7.5), 1 mM EDTA (pH 8.0) 5 M NaCl) added. 
The RNA sample was applied to a prepared push column and passed through the column at a 
rate of -1 drop every 2 sec. The solution collected was reapplied to the column and collected 
as above. 200 ^1 of high salt buffer (10 mM Tris-HCl (pH 7.5), 1 mM EDTA, 0.5 NaCl) was 
applied to the column and passed through the column at a rate of -1 drop every 2 sec. This 
step was repeated and followed by three low salt buffer (10 mM Tris-HCl (pH 7.5), 1 mM 
EDTA, 0.1 M NaCl) washes preformed in a similar manner. mRNA was eluted by applying 
to the column four separate 200 jal aliquots of elution buffer (10 mM Tris-HCl (pH 7.5), 1 
mM EDTA) preheated to 65°C. Here, the elution buffer was passed through the column at a 
rate of 1 drop/sec. The resulting mRNA solution was precipitated by adding 0.1X volumes of 
10X sample buffer, 2,5 volumes of ice-cold 100% ethanol, incubating overnight at -20°C and 
centrifuging at 14,000-18,000 X g for 20-30 min at 4°C. The pellet was washed with 70% 
ethanol and air dried for 10 min. at room temperature before resuspension in RNase-free 
deionized water. 

PREPARATION OF YEAST CONTROLS 

Plasmid DNA was isolated from the following yeast clones using Qiagen filtered 
maxiprep kits (Qiagen, Valencia, California): YAL022c(Fun26), YAL031c(Fun21), 
YBR032w, YDL131w, YDL182w, YDL194w, YDL196w, YDR050c and YDR116c. 
Plasmid DNA was linearized with either BsrBl (YAL022c(Fun26), YAL031c(Fun21), 
YDL131w, YDL182w, YDL194w, YDL196w, YDR050c) or Afllll (YBR032w, YDR116c) 
and isolated. 



In Vitro Transcription of Yeast Clones 
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The following solution was incubated at 37°C for 2 hours: 17 jil of isolated yeast 
insert DNA (1 fig), 20 \il 5X buffer, 10 jal 100 mM DTT, 2.5 jal (100 U) RNasin, 20 jil 2.5 
mM (ea.) rNTPs, 2.7 |il (40U) SP6 polymerase and 27.8 (il RNase-free deionized water. 2 \i\ 
(2 U) Ampli DNase I was added and the incubation continued for another 15 min. 10 5M 
5 NH4OAC and 100 jil phenol:chloroform:isoamyl alcohol (25:24:1) were added, the solution 
vortexed and then centrifuged to separate the phases. To precipitate the RNA, 250 jil ethanol 
was added and the solution incubated at -20°C for at least one hour. The sample was then 
centrifuged for 20 min at 4°C at 14,000-18,000 X g, the pellet washed with 500 ^1 of 70% 
ethanol, air dried at room temperature for 10 min and resuspended in 100 jal of RNase-free 
10 deionized water. The precipitation procedure was then repeated. 

Alternatively, after the two-hour incubation, the solution was extracted with 
phenol/chloroform once before adding 0.1 volume 3M sodium acetate and 2.5 volumes of 
100% ethanol. The solution was centrifuged at 15,000rpm, 4°C for 20 minutes and the pellet 
yp resuspended in RNase-free deionized water. The DNase I treatment was carried out at 37°C 
i5 for 30 minutes using 2 U of Ampli DNase I in the following reaction condition: 50 mM Tris- 
Jr! HC1 (pH 7.5), 10 mM MgCl 2 . The DNase I reaction was then stopped with the addition of 

tH NH4OAC and phenol:chloroform:isoamyl alcohol (25:24: 1), and RNA isolated as described 

ffi 

g above. 

Jj; 0.15-2.5 ng of the in vitro transcript RNA from each yeast clone were added to each 

UP plant mRNA sample prior to labeling to serve as positive (internal) probe controls. 

4:: 

N GENERATION OF PROBES FOR HYBRIDIZATION 

Generation of labeled probes for hybridization from first-strand cDNA 

Hybridization probes were generated from isolated mRNA using an Atlas™ Glass 

25 Fluorescent Labeling Kit (Clontech Laboratories, Inc., Palo Alto, California, USA). This 
entails a two step labeling procedure that first incorporates primary aliphatic amino groups 
during cDNA synthesis and then couples fluorescent dye to the cDNA by reaction with the 
amino functional groups. Briefly, 5 \ig of oligo(dT)i 8 primer d(TTTTTTTTTTTTTTTTTTV) 
was mixed with Poly A+ mRNA (1.5 - 2 jag mRNA isolated using the Qiagen Oligotex 

30 mRNA Spin-Column protocol or-the Stratagene Poly(A) Quik mRNA Isolation protocol 

(Stratagene, La Jolla, California, USA)) in a total volume of 25 |il. The sample was incubated 
in a thermocycler at 70°C for 5 min, cooled to 48°C and 10 |il of 5X cDNA Synthesis Buffer 
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(kit supplied), 5 |al 10X dNTP mix (dATP, dCTP, dGTP, dTTP and aminoallyl-dUTP; kit 
supplied), 7.5 fil deionized water and 2.5 jal MMLV Reverse Transcriptase (500U) added. 
The reaction was then incubated at 48°C for 30 minutes, followed by Ihr incubation at 42°C. 
At the end of the incubation the reaction was heated to 70°C for 10 min, cooled to 37°C and 
5 0.5 |il (5 U) RNase H added, before incubating for 15 min at 37°C. The solution was 

vortexed for 1 min after the addition of 0.5 |al 0.5 M EDTA and 5 |al of QuickClean Resin (kit 
supplied) then centrifuged at 14,000-18,000 X g for 1 min. After removing the supernatant to 
a 0.45 |um spin filter (kit supplied), the sample was again centrifuged at 14,000-18,000 X g 
for 1 min, and 5.5 jal 3 M sodium acetate and 137.5 jal of 100% ethanol added to the sample 
10 before incubating at -20°C for at least 1 hr. The sample was then centrifuged at 14,000- 

18,000 X g at 4°C for 20 min, the resulting pellet washed with 500 ^1 70% ethanol, air-dried 
at room temperature for 10 min and resuspended in 10 |il of 2X fluorescent labeling buffer 
(kit provided). 10 jal each of the fluorescent dyes Cy3 and Cy5 (Amersham Pharmacia 
£ j (Piscataway, New Jersey, USA); prepared according to Atlas™ kit directions of Clontech) 

were added and the sample incubated in the dark at room temperature for 30 min. 
% The fluorescently labeled first strand cDNA was precipitated by adding 2 \i\ 3M 

£ sodium acetate and 50 \xl 100% ethanol, incubated at -20°C for at least 2 hrs, centrifuged at 
i 14,000-18,000 X g for 20 min, washed with 70% ethanol, air-dried for 10 min and dissolved 
t; in 100 \\X of water. 



3 control and 3 ug oligo dTV (TTTTTTTTTTTTTTTTTT(A/C/G); Sequence ID No.: X) were 
mixed in a total volume of 24.7 jal. The sample was incubated in a thermocycler at 70°C for 
10 min. before chilling on ice. To this, 8 \i\ of 5X first strand buffer (Superscript II RNase H- 
Reverse Transcriptase kit from Invitrogen (Carlsbad, California 92008); cat no. 18064022), 
25 0.8 °C of aa-dUTP/dNTP mix (SOX; 25mM dATP, 25mM dGTP, 25mM dCTP, 15mM dTTP, 
lOmM aminoallyl-dUTP), 4 (al of 0.1 M DTT and 2.5 [i\ (500 units) of Superscript R.T.II 
enzyme (Stratagene) were added. The sample was incubated at 42°C for 2 hours before a 
mixture of 10 °C of 1M NaOH and 10°C of 0.5 M EDTA were added. After a 15 minute 
incubation at 65°C, 25 |il of 1 M Tris pH 7.4 was added. This was mixed with 450 jal of water 
30 in a Microcon 30 column before centrifugation at 1 1,000 X g for 12 min. The column was 
washed twice with 450 |il (centrifugation at 1 1,000 g, 12 min.) before eluting the sample by 




Alternatively, 3-4 jig mRNA, 2.5 (-8.9 ng of in vitro translated mRNA) |il yeast 
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inverting the Microcon column and centrifuging at 1 1,000 X g for 20 seconds. Sample was 
dehydrated by centrifiigation under vacuum and stored at -20°C. 

Each reaction pellet was disolved in 9 |il of 0.1 M carbonate buffer (0.1M sodium 
carbonate and sodium bicarbonate, pH=8.5-9) and 4.5 jil of this placed in two microfuge 
tubes. 4.5 jil of each dye (in DMSO) were added and the mixture incubated in the dark for 1 
hour. 4.5 (il of 4 M hydroxylamine was added and again incubated in the dark for 15 minutes. 

Regardless of the method used for probe generation, the probe was purified using a 
Qiagen PCR cleanup kit (Qiagen, Valencia, California, USA), and eluted with 100 ul EB (kit 
provided). The sample was loaded on a Microcon YM-30 (Millipore, Bedford, 
Massachusetts, USA) spin column and concentrated to 4-5 ul in volume. Probes for the 
maize microarrays were generated using the Fluorescent Linear Amplification Kit (cat. No. 
G2556A) from Agilent Technologies (Palo Alto, CA). 

HYBRIDIZATION AND WASH CONDITIONS 

The following Hybridization and Washing Condition were developed: 

Hybridization Conditions : 

Labeled probe was heated at 95°C for 3 min and chilled on ice. Then 25 C3L of the 
hybridization buffer which was warmed at 42C was added to the probe, mixing by pipetting, 
to give a final concentration of: 

50% formamide 

4x SSC 

0.03% SDS 
5x Denhardt's solution 

0.1 |ig/ml single-stranded salmon sperm DNA 

The probe was kept at 42C. Prior to the hybridization, the probe was heated for 1 
more min., added to the array, and then covered with a glass cover slip. Slides were placed in 
hybridization chambers (Telechem, Sunnyvale, California) and incubated at 42°C overnight. 

Washing Conditions : 
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A. Slides were washed in Ix SSC + 0.03% SDS solution at room temperature for 5 
minutes, 

B. Slides were washed in 0.2x SSC at room temperature for 5 minutes, 

C. Slides were washed in 0.05x SSC at room temperature for 5 minutes. 

After A, B, and C, slides were spun at 800 x g for 2 min. to dry. They were then 
scanned. 

Maize microarrays were hybridized according to the instructions included Fluorescent 
Linear Amplification Kit (cat. No. G2556A) from Agilent Technologies (Palo Alto, CA). 

SCANNING OF SLIDES 

The chips were scanned using a ScanArray 3000 or 5000 (General Scanning, 
Watertown, Massachusetts, USA). The chips were scanned at 543 and 633nm, at 10 urn 
resolution to measure the intensity of the two fluorescent dyes incorporated into the samples 
hybridized to the chips. 
DATA EXTRACTION AND ANALYSIS 

The images generated by scanning slides consisted of two 16-bit TIFF images 
representing the fluorescent emissions of the two samples at each arrayed spot. These images 
were then quantified and processed for expression analysis using the data extraction software 
Imagene ™ (Biodiscovery, Los Angeles, California, USA). Imagene output was subsequently 
analyzed using the analysis program Genespring™ (Silicon Genetics, San Carlos, California, 
USA). In Genespring, the data was imported using median pixel intensity measurements 
derived from Imagene output. Background subtraction, ratio calculation and normalization 
were all conducted in Genespring. Normalization was achieved by breaking the data in to 32 
groups, each of which represented one of the 32 pin printing regions on the microarray. 
Groups consist of 360 to 550 spots. Each group was independently normalized by setting the 
median of ratios to one and multiplying ratios by the appropriate factor. 

EXAMPLE 4: AFLP EXPERIMENTS AND RESULTS 

Production of Samples 
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mRNA was prepared from 27 plant tissues. Based on preliminary cDNA-AFLP 
analysis with a few primer combinations, 1 1 plant tissues and/or pooled samples were 
selected. Samples were selected to give the greatest representation of unique band upon 
electrophoresis. The final 1 1 samples or pooled samples used in the cDNA-AFLP analysis 
were: 



5 1 Dark adapted seedlings 

52 Roots/Etiolated Seedlings 

53 Mature leaves, soil grown 

54 Immature buds, inflorescence meristem 

55 Flowers opened 

56 Siliques, all stages 

57 Senescing leaves (just beginning to yellow) 

58 Callus Inducing medium 
Callus shoot induction 
Callus root induction 

59 Wounding 

Methyl-j asmonate- treated 

510 Oxidative stress 
Drought stress 
Oxygen Stress-flooding 

5 1 1 Heat treated light grown seedling 
Cold treated light grown seedlings 



cDNA from each of the 1 1 samples was digested with two restriction endonucleases, 
namely Taql and Msel. Taql and Msel adapters were then ligated to the restriction enzyme 
fragments. Using primers to these adapters that were specific in sequence (i.e. without 
extensions), the restriction fragments were subjected to cycles of non-radioactive pre- 
amplification. 

Selective PCR 

In order to limit the number of fragments or bands on each lane of the AFLP gel, 
fragments were subjected to another round of selective radioactive polymerase chain 
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amplification. The Taql primers used in this amplification were 5 '-labelled with For 
these amplifications, the Taql primers had two extra nucleotides at their 3 5 end and the Msel 
primers had three extra nucleotides at their 3' end. This resulted in 16 primer designs for the 
Taql primer and 64 primer designs for the Msel primer. Altogether, this gave rise to a total of 
1024 primer designs. Fragments generated in this selective amplification protocol were run 
with labeled molecular weight markers on polyacrylamide gels to separate fragments in the 
size range of 100 - 600 nucleotides. 

Following gel electrophoresis, profiles were analyzed with a phosphoimager. From these 
images, electronic files, giving the mobilities of all bands on the gels and their intensities in 
each of the samples, were compiled. 

All unique bands were cut out of the gels. The gel pieces were placed in 96 well 
plates for elution and their plate designation was linked to their electrophoretic mobilities 
recorded in the electronic files. The eluted fragments were then subjected to another round of 
amplification, this time using reamplification primers (see below). After amplification, DNA 
fragments were sequenced. 

A computer database was established linking the mobilities of all the bands observed 
on the cDNA-AFLP gels with the sequence of the correspondingly isolated fragment. The 
sequence allowed for identification of the gene from which the cDNA-AFLP fragment was 
derived, allowing for a linkage of band mobility with the transcript of a specific gene. Also 
linked to the band mobilities were their intensities recorded for each of the eleven samples 
used in constructing the database. 

This cDNA-AFLP analysis with TaqllMsel and 1024 primer combinations was repeated 
using the enzymes Malll in place of Taql, and Csp6l in place of Msel. 

Using the Database for the Transcript Profiling of Experimental Samples 

Experimental Samples were subjected to cDNA-AFLP as described above, resulting 
in electronic files recording band mobilities and intensities. Through use of the database 
established above, band mobilities could be linked to specific cDNAs, and therefore genes. 
Furthermore, the linkage with the intensities in the respective samples allowed for the 
quantification of specific cDNAs in these samples, and thus the relative concentration of 
specific transcripts in the samples, indicating the level to which specific genes were 
expressed. 
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Reamplification primers 
99G24 

CGCCAGGGTTTTCCCAGTCACGACMCG^CrC4CJ|gatgagtcctgagtaa| 
M13 forward +10 MseI+0 

5 

99G20 

AGCGGATAACAATTTCACACAGGA|C4C4CrGG7^|tagactgcgtaccga| 
M13 reverse +10 TaqI +0 

10 Purification of the Reampli fiction reaction before sequencing 

5 ul reamplification reaction 
0,25 ul lOxPCR buffer 

0 0,33 ul Shrimp Alkaline Phosphatase (Amersham Life Science) 
}$ 0,033 ul Exonuclease I (USB) 

f 0,297 ul SAP dilution buffer 
fjS 1,59 ulMQ 

111 

ffi 7.5 ^1 total 

a 

1 j 

So 30' 37°C 

% 10' 80°C 

C 4°C 

Sample Preparation 

25 SI: Dark adapted seedlings ; Seeds of Arabidopsis thaliana (wassilewskija) were sown in pots 
and left at 4°C for two to three days to vernalize. They were transferred to a growth chamber 
after three days. The intensity of light in the growth chamber was 7000-8000 LUX, 
temperature was 22°C, with 16 h light and 8 h dark. After 8 days, the seedlings were foil- 
wrapped and harvested after two days. 



30 



S2: Roots/Etiolated seedlings: Seeds of Arabidopsis thaliana (wassilewskija) were 
germinated on solid germination media (IX MS salts, IX MS vitamins, 20g/L sucrose, 50 
mg/L MES pH 5.8) in the dark. Tissues were harvested 14 days later. 
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S3: Mature leaves, soil grown: Seeds of Arabidopsis thaliana (wassilewskija) were sown in 
pots and left at 4°C for two to three days to vernalize. They were transferred to a growth 
chamber after three days. The intensity of light in the growth chamber was 7000-8000 LUX, 
temperature was 22°C, with 16 h light and 8 h dark. Leaves were harvested 17 days later 
from plants that had not yet bolted. 

S4: Immature buds, inflorescence meristem : Seeds of Arabidopsis thaliana (wassilewskija) 
were sown in pots and left at 4°C for two to three days to vernalize. They were transferred to 
a growth chamber after three days. The intensity of light in the growth chamber was 7000- 
8000 LUX, temperature was 22°C, with 16 h light and 8 h dark. 

S5: Flowers, opened: Seeds of Arabidopsis thaliana (wassilewskija) were sown in pots and 
left at 4°C for two to three days to vernalize. They were transferred to a growth chamber 
after three days. The intensity of light in the growth chamber was 7000-8000 LUX, 
temperature was 22°C, with 16 h light and 8 h dark. 

S6: Siliques, all stages : Seeds of Arabidopsis thaliana (wassilewskija) were sown in pots and 
left at 4°C for two to three days to vernalize. They were transferred to a growth chamber 
after three days. The intensity of light in the growth chamber was 7000-8000 LUX, 
temperature was 22°C, with 16 h light and 8 h dark. 

S7: Senescing leaves (just beginning to yellow): Seeds of Arabidopsis thaliana 
(wassilewskija) were sown in pots and left at 4°C for two to three days to vernalize. They 
were transferred to a growth chamber after three days. The intensity of light in the growth 
chamber was 7000-8000 LUX, temperature was 22°C, with 16 h light and 8 h dark. When 
the plant had leaves that were less than 50% yellow, the leaves that were just beginning to 
yellow were harvested. 



Callus Inducing Medium : Seeds of Arabidopsis thaliana (wassilewskija) were 
surface sterilized (1 min-75% Ethanol, 6 min-bleach 100% + Tween 20, rinse) and incubated 



S8: 
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on MS medium containing 2,4-Dichlorophenoxyacetic acid (2,4-D) 1 mg/1 and Kinetin 1 mg/1 
in the dark for 3 weeks to generate primary callus. 

Hypocotyls and roots of the seedling were swollen after a week after incubation in this callus 
induction medium and subsequently callus was initiated from these swollen areas. 

Callus shoot induction : Primary calluses were transferred to the fresh callus induction 
medium for another 2 weeks growth to generate secondary callus. Secondary callus were 
transferred to shoot induction medium containing MS basal medium and Benzyladenine (BA) 
2 mg/1 and Naphthaleneacetic acid (NAA) ).l mg/1 for 2 weeks growth in the light before it 
was harvested and frozen and sent to Keygene. Many shoot meristems were observed under 
the microscope. 

Callus root induction : Secondary calluses were transferred to root induction medium 
containing MS basal medium, sucrose 1% and Indolebutyric acid (IB A) 0.05 mg/1 in the dark. 
Many root primordia were observed under microscope after 10 days in the root induction 
medium. Those callus tissue were harvested and frozen and sent to Keygene. 




S9: 

Wounding: Seeds of Arabidopsis thaliana (wassilewskija) were sown in pots and left 
at 4°C for two to three days to vernalize. They were transferred to a growth chamber after 
three days. The intensity of light in the growth chamber was 7000-8000 LUX, temperature 
was 22°C, with 16 h light and 8 h dark. After 20 days, leaves of plants were wounded with 
pliers. Wounded leaves were harvested I hour and 4 hours after wounding. 



Methyl jasmonate treatment: Seeds of Arabidopsis thaliana (wassilewskija) were 
sown in pots and left at 4°C for two to three days to vernalize. They were transferred to a 
growth chamber after three days. The intensity of light in the growth chamber was 7000- 
8000 LUX, temperature was 22°C, with 16 h light and 8 h dark. After 13 days, plants were 
sprayed with 0.001% methyl jasmonate. Leaves were harvested 1.5 hours and 6 hours after 
spraying 

S10: 

Oxidative stress : Seeds of Arabidopsis thaliana (wassilewskija) were sown in pots 
and left at 4°C for two to three days to vernalize. They were transferred to a growth chamber 
after three days. The intensity of light in the growth chamber was 7000-8000 LUX, 
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temperature was 22°C, with 16 h light and 8 h dark. After 24 days, a few leaves were 
inoculated with a mixture of 2.5 mM D-glucose, 2.5 U/mL glucose oxidase in 20 mM sodium 
phosphate buffer pH 6.5. After an hour, 3 hours, or 5 hours after inoculation, whole plant, 
except for the inoculated leaves, was harvested. This sample was mixed with sample from 
plants that were sitting in full sun (152,000 LUX) for 2 hours or four hours. 

Drought stress : Seeds of Arabidopsis thaliana (wassilewskija) were sown in pots and 
left at 4°C for two to three days to vernalize. They were transferred to a growth chamber 
after three days. The intensity of light in the growth chamber was 7000-8000 LUX, 
temperature was 22°C, with 16 h light and 8 h dark. After 20 days, aerial tissues were 
harvested and left to dry in 3MM Whatman paper for 1 hour or 4 hours. 

Oxygen stress : Seeds of Arabidopsis thaliana (wassilewskija) were sown in pots and 
left at 4°C for two to three days to vernalize. They were transferred to a growth chamber 
after three days. The intensity of light in the growth chamber was 7000-8000 LUX, 
temperature was 22°C, with 16 h light and 8 h dark. After 21 days, the plant was flooded by 
immersing its pot in a beaker of tap water. After 6 days, the upper tissues were harvested. 

S 1 1 : Heat-treated light grown seedlings: Seeds of Arabidopsis thaliana (wassilewskija) were 
sown in pots and left at 4°C for two to three days to vernalize. They were transferred to a 
growth chamber after three days. The intensity of light in the growth chamber was 7000- 
8000 LUX, temperature was 22°C, with 16 h light and 8 h dark. Over a 5 hour period, the 
temperature was raised to 42°C at the rate of approximately 4°C per hour. After 1 hour at 
42°C, the aerial tissues were collected. This sample was mixed with an equal volume of 
sample that went through a heat-recovery treatment namely bringing down the temperature to 
22°C from 42°C over a 5 hour period at the rate of 4°C per hour. 

Cold- treated light grown seedlings: Seeds of Arabidopsis thaliana (wassilewskija) 
were sown in pots and left at 4°C for two to three days to vernalize. They were transferred to 
a growth chamber after three days. The intensity of light in the growth chamber was 7000- 
8000 LUX, temperature was 22°C, with 16 h light and 8 h dark. After 18 days, the plant was 
transferred to 4°C for an hour before the aerial tissues were harvested. This sample was 
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mixed with aerial tissues from another plant that was transferred to 4°C for 27 hours before 
being harvested. 

Analysis of Data: 

Intensity : The intensity of the band corresponds to the value in each lane marked Sl 5 S2 etc. 

P- values : The data shows P- values of each of the samples 1-11. P-values are calculated 
using the following formula 2*(l-NORMDIST(ABS(Sx-AVERAGE(of SI to SI 1, not 
including Sx))/STDEV(of SI to Sll not including Sx),0,l,TRUE)) using Excel functions. 

The equivalent mathematical formula of P-value is as follows: 
J cp(x) dx, integrated from a to co, 

where (p(x) is a normal distribution: 
where a = |Sx-|i] 

a(Sl . . .SI 1, not including Sx); 
where |i = is the average of the intensities of all samples except Sx, 
= (£Sl...SnVSx 
n-1 

where a(S 1 ... S 1 1 , not including Sx) = the standard deviation of all sample intensities 
except Sx. 

Results : 

The results are shown in the MA_diff tables. 

EXAMPLE 5: TRANSFORMATION OF CARROT CELLS 

Transformation of plant cells can be accomplished by a number of methods, as 
described above. Similarly, a number of plant genera can be regenerated from tissue culture 
following transformation. Transformation and regeneration of carrot cells as described herein 
is illustrative. 

Single cell suspension cultures of carrot (Daucus carota) cells are established from 
hypocotyls of cultivar Early Nantes in B 5 growth medium (O.L. Gamborg et al., Plant 
Physiol 45:372 (1970)) plus 2,4-D and 15 mM CaCl 2 (B 5 -44 medium) by methods known in 
the art. The suspension cultures are subcultured by adding 10 ml of the suspension culture to 
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40 ml of B 5 -44 medium in 250 ml flasks every 7 days and are maintained in a shaker at 150 
rpm at 27 °C in the dark. 

The suspension culture cells are transformed with exogenous DNA as described by Z. 
Chen et al. Plant Mol Bio. 36:163 (1998). Briefly, 4-days post-subculture cells are incubated 
with cell wall digestion solution containing 0.4 M sorbitol, 2% driselase, 5mM MES (2-[N- 
Morpholino] ethanesulfonic acid) pH 5.0 for 5 hours. The digested cells are pelleted gently 
at 60 xg for 5 min. and washed twice in W5 solution containing 1 54 mM NaCl, 5 mM KC1, 
125 mM CaCl 2 and 5mM glucose, pH 6.0. The protoplasts are suspended in MC solution 
containing 5 mM MES, 20 mM CaCl 2 , 0.5 M mannitol, pH 5.7 and the protoplast density is 
adjusted to about 4 x 10 6 protoplasts per ml. 

15-60 (ig of plasmid DNA is mixed with 0.9 ml of protoplasts. The resulting 
suspension is mixed with 40% polyethylene glycol (MW 8000, PEG 8000), by gentle 
inversion a few times at room temperature for 5 to 25 min. Protoplast culture medium known 
in the art is added into the PEG-DNA-protoplast mixture. Protoplasts are incubated in the 
culture medium for 24 hour to 5 days and cell extracts can be used for assay of transient 
expression of the introduced gene. Alternatively, transformed cells can be used to produce 
transgenic callus, which in turn can be used to produce transgenic plants, by methods known 
in the art. See, for example, Nomura and Komamine, Pit. Phys. 79:988-991 (1985), 
Identification and Isolation of Single Cells that Produce Somatic Embryos in Carrot 
Suspension Cultures. 

EXAMPLE 6: PHENOTYPE SCREENS AND RESULTS 

A: Triparental Mating and Vacuum Infiltration Transformation of Plants 

Standard laboratory techniques are as described in Sambrook et al. (1989) unless 
otherwise stated. Single colonies of Agrobacterium C58ClRif, E. coli helper strain HB101 
and the E. coli strain containing the transformation construct to be mobilized into 
Agrobacterium were separately inoculated into appropriate growth media and stationary 
cultures produced. 100 jal of each of the three cultures were mixed gently, plated on YEB (5g 
Gibco beef extract, lg Bacto yeast extract, lg Bacto peptone, 5g sucrose, pH 7.4) solid 
growth media and incubated overnight at 28°C. The bacteria from the triparental mating were 
collected in 2 ml of lambda buffer (20 mM Tris (pH 7.5), 100 mM NaCl, 10 mM MgCl 2 ) and 
serial dilutions made. An aliquot of the each dilution was then plated and incubated for 2 
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days at 28°C on YEB plates supplemented with 100 ng/ml rifampicin and 100 |ig/ml 
carbenicillin for calculation of the number of acceptor cells and on YEB plates supplemented 
with 100 (ig/ml rifampicin, 100 |ig/ml carbenicillin and 100 (ig/ml spectinomycin for 
selection of transconjugant cells. The cointegrate structure of purified transconjugants was 
verified via Southern blot hybridization. 

A transconjugant culture was prepared for vacuum infiltration by inoculating 1 ml of 
a stationary culture arising from a single colony into liquid YEB media and incubating at 
28°C for approximately 20 hours with shaking (220 rpm) until the OD taken at 600 nm was 
0.8-1.0. The culture was then pelleted (8000 rpm, 10 min, 4°C in a Sorvall SLA 3000 rotor) 
and the bacteria resuspended in infiltration medium (0.5X MS salts, 5% w/v sucrose, 10 jig/1 
BAP, 200 Silwet L-77, pH 5.8) to a final OD 600 of 1.0. This prepared transconjugant 
culture was used within 20 minutes of preparation. 

Wild-type plants for vacuum infiltration were grown in 4-inch pots containing 
Metromix 200 and Osmocote. Briefly, seeds of Arabidopsis thaliana (ecotype 
Wassilewskija) were sown in pots and left at 4°C for two to four days to vernalize. They 
were then transferred to 22-25°C and grown under long-day (16 hr light: 8 hr dark) 
conditions, sub-irrigated with water. After bolting, the primary inflorescence was removed 
and, after four to eight days, the pots containing the plants were inverted in the vacuum 
chamber to submerge all of the plants in the prepared transconjugant culture. Vacuum was 
drawn for two minutes before pots were removed, covered with plastic wrap and incubated in 
a cool room under darkness or very low light for one to two days. The plastic wrap was then 
removed, the plants returned to their previous growing conditions and subsequently produced 
(Tl) seed collected. 

B: Selection of T-DNA Insertion Lines 

Approximately 10,750 seeds from the initial vacuum infiltrated plants were sown per 
flat of Metromix 350 soil. Flats were vernalized for four to five days at 4°C before being 
transferred to 

22-25°C and grown under long-day (16 hr light: 8 hr dark) conditions, sub-irrigated 
with water. Approximately seven to ten days after germination, the (Tl) seedlings were 
sprayed with 0.02% Finale herbicide (AgrEvo). After another five to seven days, herbicide 
treatment was repeated. Herbicide resistant Tl plants were allowed to self-pollinate and T2 
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seed were collected from each individual. In the few cases where the Tl plant produced few 
seed, the T2 seed was planted in bulk, the T2 plants allowed to self-pollinate and T3 seed 
collected. 



C: Phenotype Screening 

Approximately 40 seed from each T2 (or T3) line were planted in a 4-inch pot 
containing either Sunshine mix or Metromix 350 soil. Pots were vernalized for four to five 
days at 4°C before being transferred to 22-25°C and grown under long-day (16 hr light: 8 to- 
dark) conditions, sub-irrigated with water. A first phenotype screen was conducted by 
visually inspecting the seedlings five to seven days after germination and aberrant phenotypes 
noted. Plants were then sprayed with Finale herbicide within four days (i.e. about seven to 
nine days after germination). The second visual screen was conducted on surviving T2 (or 
T3) plants about sixteen to seventeen days after germination and the final screen was 
conducted after the plants had bolted and formed siliques. Here, the third and fourth green 
siliques were collected and aberrant phenotypes noted. The Knock-in and Knock-out Tables 
contain descriptions of identified phenotypes. 

Alternative, seed were surface sterilized and transferred to agar solidified medium 
containing Murashige and Skoog salts (IX), 1% sucrose (wt/v) pH 5.7 before autoclaving. Seed 
were cold treated for 48 hours and transferred to long days [16 hours light and 8 hours dark], 
25°C. Plants were screened at 5 and 10 days. 

In another screen, seed were surface sterilized and transferred to agar solidified medium 
containing Murashige and Skoog salts (IX), and combinations of various nitrogen and sucrose 
amounts as specified below:: 

Medium 1: no sucrose, 20.6 mM NH4NO3, 18.8 mM KN0 3 ; 

Medium 2: 0.5% sucrose, 20.6 mM NH4NO3, 18.8 mM KN0 3 ; 

Medium 3: 3% sucrose, 20.6 mM NH4NO3, 18.8 mM KN0 3 ; 

Medium 4: no sucrose, 20.6 |iM NH4NO3, 18.8 \M KN0 3 ; 

Medium 5: 0.5% sucrose, 20.6 |iM NH4NO3, 18.8 pM KN0 3 ; and 

Medium 6: 3% sucrose, 20.6 jiM NH4NO3, 18.8 jiM KN0 3 . 
The 0.5% sucrose was the control concentration for the sucrose. The low nitrogene, 20.6 jaM 
NH4N0 3 , 18.8 ^iM KNO3, is the control for the nitrogen. Seed were cold treated for 48 hours 
and transferred to long days [16 hours light and 8 hours dark], 25°C. Plants were screened at 2, 
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5, and 10 days. 

D: TAIL-PCR and Fragment Sequencing 

5 Rosette leaves were collected from each putative mutant and crushed between 

parafilm and FTA paper (Life Technologies). Two 2mm 2 hole punches were isolated from 
each FTA sample and washed according to the manufacturer's instructions by vortexing with 
200 ul of the provided FTA purification reagent. The FTA reagent was removed and the 
washing procedure repeated two more times. The sample was then washed twice with 200 ul 
10 of FTA TE (10 mM Tris, 0.1 mM EDTA, pH 8.0) and vortexing prior to PCR. 

Primers used for TAIL-PCR are as follows: 
AD2: 5' NGTCGASWGANAWGAA 3' (128-fold degeneracy) 
C S = G or C, W - A or T, and N = A, G, C, or T 



LB1 : 5' GTTTAACTGCGGCTCAACTGTCT 3' 
LB2: 5' CCCATAGACCCTTACCGCTTTAGTT 3' 
m LB3: 5' GAAAGAAAAAGAGGTATAACTGGTA 3' 

E W 

as 
b 

p The extent to which the left and right borders of the T-DNA insert were intact was 

measured for each line by PCR. The following components were mixed for PCR: 1 2mm 
FTA sample, 38.75 ^1 distilled water, 5 |xl 10X Platinum PCR buffer (Life Technologies), 2 

25 (il 50 mM MgCl 2 , 1 ul 10 mM dNTPs, 1 jxl 10 ^M primer LB1 (or RBI for analysis of the 

right border), 1 \i\ 10 |iM primer LB3R (or RB3R for analysis of the right border) and 1.25 U 
Platinum Taq (Life Technologies). Cycling conditions were: 94°C, 10 sec; thirty cycles of 
94°C, 1 sec. - 54°C, 1 sec. - 72°C, 1 sec; 72°C, 4 sec. The expected band size for an intact left 
border is bp, while an intact right border generates a bp band. 



Fragments containing left or right border T-DNA sequence and adjacent genomic 
DNA sequence were obtained via PCR. First product PCR reactions use the following 
reaction mixture: 1 2mm 2 FTA sample, 12.44 \il distilled water, 2 |al 10X Platinum PCR 
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buffer (Life Technologies), 0.6 ul 50 mM MgCl 2 , 0.4 ul 10 mM dNTPs, 0.4 ul 10 uM primer 
LB1 (or RBI for analysis of the right border), 3 ul 20 uM primer AD2 and 0.8 U Platinum 
Taq (Life Technologies). Cycling conditions for these reactions were: 93°C, 1 min.; 95°C, 1 
min.; three cycles of 94°C, 45 sec. - 62°C, 1 min. - 72°C, 2.5 min.; 94°C,45 sec; 25°C, 3 
5 min.; ramp to 72°C in 3 min.; 72°C, 2.5 min.; fourteen cycles of 94°C, 20 sec. - 68°C, 1 min. 

- 72°C, 2.5 min. - 94°C, 20 sec; - 68°C, 1 min. - 72°C, 2.5 min. - 94°C, 20 sec. - 44°C, 1 min. 

- 72°C, 2.5 min.; 72°C, 5 min.; end; -4.5 hrs. For second product PCR reactions 1 ul of a 1:50 
dilution of the first PCR product reaction was mixed with 13.44 ul distilled water, 2 ul 10X 
Platinum PCR buffer (Life Technologies), 0.6 ul 50 mM MgCl 2 , 0.4 ul 10 mM dNTPs, 0.4 ul 

10 10 uM primer LB2 (or RB2 for analysis of the right border), 2 ul 20 uM primer AD2 and 0.8 
U Platinum Taq (Life Technologies). Second product cycling conditions were: eleven cycles 
of 94°C, 20 sec. - 64°C, 1 min. - 72°C, 2.5 min. - 94°C, 20 sec. - 64°C, 1 min. - 72°C, 2.5 min. 

- 94°C, 20 sec. - 44°C, 1 min.; 72°C, 5 min.; end; ~3 hrs. Third product PCR reactions were 
O prepared by first diluting 2 ul of the second PCR product with 98 ul of distilled water and 
B then adding 1 ul of the dilution to 13.44 ul distilled water, 2 ul 10X Platinum PCR buffer 
| ; (Life Technologies), 0.6 ul 50 mM MgCl 2 , 0.4 ul 1 0 mM dNTPs, 0.4 ul 1 0 uM primer LB3 

few-' 

TU (or RB3 for analysis of the right border), 2 ul 20 pM primer AD2 and 0.8 U Platinum Taq 

Jrl (Life Technologies). Third product cycling conditions were: twenty cycles of 94°C, 38 sec. - 

^ 44°C, 1 min. - 72°C, 2.5 min.; 72°C, 5 min.; end; ~2 hrs. Aliquots of the first, second and 

30 third PCR products were electrophoresed on 1% TAE (40 mM Tris-acetate, 1 mM EDTA) to 

ft \ 

% determine their size. 

Reactions were purified prior to sequencing by conducting a final PCR reaction. Here, 
0.25 pi Platinum PCR Buffer (Life Technologies), 0.1 pi 50 mM MgCl 2 , 3.3 U SAP shrimp 
alkaline phosphatase, 0.33 U Exonuclease and 1.781 pi distilled water were added to a 5 pi 
25 third product and the reaction cycled at 37°C, 30 min.; 80°C, 10 min.; 4°C indefinitely. 

Di-deoxy "Big Dye" sequencing was conducted on Perkin-Elmer 3700 or 377 
machines. 



KNOCK-IN EXPERIMENTS 

For the following examples, a two-component system was constructed in a plant to 
ectopically express the desired cDNA. 
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First, a plant was generated by inserting a sequence encoding a transcriptional 
activator downstream of a desired promoter, thereby creating a first component where the 
desired promoter facilitates expression of the activator generated a plant. The first 
component also is referred to as the activator line. 

Next, the second component is constructed by linking a desired cDNA to a sequence 
that the transcriptional activator can bind to and facilitate expression of the desired cDNA. 
The second component can be inserted into the activator line by transformation. 
Alternatively, the second component can be inserted into a separate plant, also referred to as 
the target line. Then, the target and activator lines can be crossed to generate progeny that 
have both components. 

Two component lines were generated by both means. 

Part I - From crosses 

Target lines containing cDNA constructs are generated using the Agrobacterium-mediated 
transformation. Selected target lines are genetically crossed to activation lines (or promoter 
lines). Generally, the promoter lines used are as described above. Evaluation of phenotypes 
is done on the resulting Fl progenies. 

Part II - From Type I Supertransformation 

Promoter activation lines (generally Vascular/Ovule/Y oung Seed/Embryo line, 
Seed/Epidermis/Ovary/Fruit line, Roots/Shoots/Ovule line, and Vasculature/Meristem are 
transformed with cDNA constructs using the Agrobacterium mediated transformation. 
Selected transformants (and their progenies) are evaluated for changes in phenotypes. The 
table for the knock-in of the Type I supertransformation comprises the following information 

Clone ID, 

Pfam, 

Gemini ID 
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trans. Unique ED (which indicates what promoter activation line was 
transformed 

S Ratio: segregation ratio after the transformed plants are selected for 

the marker. 

Assay 

Stage: phenotype was observed 

Feature: Where the phenotype was observed 

Phenotype 

P Ratio: phenotype ratio 
Comments 



Part III - From Type II Supertransformation 

Target lines generated using the procedure mentioned in Part I are transformed with T-DNA 
construct containing constitutive promoter. Selected transformants (and their progenies) are 
evaluated for changes in phenotypes. 



An additional deposit of an E. coli Library, E. co//LibA021800, was made at the 
American Type Culture Collection in Manassas, Virginia, USA on February 22, 2000 to meet 
the requirements of Budapest Treaty for the international recognition of the deposit of 
microorganisms. This deposit was assigned ATCC accession no. PTA-141 1 . 
Additionaly, ATCC Library deposits; PTA-1 161, PTA-141 1 and PTA-2007 were made at the 
American Type Culture Collection in Manassas, Virginia, USA on; January 7, 2000, 
February 23, 2000 and June 8, 2000 respectively, to meet the requirements of Budapest 
Treaty for the international recognition of the deposit of microorganisms. 

- The invention being thus described, it will be apparent to one of ordinary skill in the 
art that various modifications of the materials and methods for practicing the invention can be 
made. Such modifications are to be considered within the scope of the invention as defined 
by the following claims. 

Each of the references from the patent and periodical literature cited herein is hereby 
expressly incorporated in its entirety by such citation. 
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